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APPLICATION FOR PATENT 



Inventors: Ziv Shani and Oded Shoseyov 

Title: PROCESS OF EXPRESSING AND ISOLATING 

RECOMBINANT PROTEINS AND RECOMBINANT 
PROTEIN PRODUCTS FROM PLANTS, PLANT 
DERIVED TISSUES OR CULTURED PLANT CELLS 

FIELD AND BACKGROU ND OF THE IN VENTION 

The present invention relates to a process of expressing and isolating 
recombinant proteins and recombinant protein products from plants, plant 
derived tissue or cultured plant cells, which process exploits (i) the high 
affinity between cellulose binding peptides and cellulose; (ii) the inherent 
abundance of cellulose in planta; and/or (iii) the simplicity associated with 
cellulose isolation from plants, plant derived tissue and cultured plant cells. 

More particularly, the present invention relates to a process 
expressing and isolating recombinant proteins and recombinant protein 
products from plants, plant derived tissue or cultured plant cells, which 
process employs the expression of a fusion protein including a recombinant 
protein and a cellulose binding peptide fused thereto, plant homogenization, 
isolation of a fusion protein cellulosic matter complex and optional 
subsequent isolation of the fusion protein and/or the recombinant protein 
from the complex. The present invention further relates to nucleic acid 
molecules and to genetically modified or viral infected plants or plant cells 
which are useful while implementing the process, and further to a novel 
composition of matter which results from the process. 
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Citation or identification of any reference in this section or in any 
other section of this application shall not be construed as an admission that 
such reference is available as prior art to the present invention. 

With the advent of recombinant technology, techniques for the 

5 genetic transformation of various host organisms, such as bacteria, yeasts, 
fungi, plants and animals, for the purposes of producing specific proteins 
through the expression of heterologous or foreign genes have been 
extensively developed. 

Using these recombinant techniques and hosts, numerous 

10 commercially important recombinant proteins (examples of which are 
included hereinbelow) have been expressed and purified. Expression and 
isolation of a protein of interest on a commercial scale, neccesitate the 
selection of a suitable expression host. This selection largely depends on 
the economics of production and purification, as well as the ability of the 

15 host to accomplish the post-translational modifications needed for full 
biological activity of the recombinant protein. 

Much of the early work in biotechnology was directed toward the 
expression of recombinant or "heterologous" proteins in prokaryotes like 
Escherichia coli and Bacillus subtilis because of the relative ease of genetic 

20 manipulation, growth in batch culture and large-scale fermentation of 
prokaryotes. 

Although E. coli can in certain cases perform some post translational 
modifications and events, such as, protein folding and disulfide bond 
formation, it cannot secrete proteins extracellularly nor can it glycosylate, 
25 gamma carboxylate, beta hydroxylate, acetylate or process pre- and pro- 
peptides. B. subtilis suffers from the same limitations as E. coli except that 
it is capable of extracellular secretion. 

Furthermore, E. coli and other bacteria are pathogens and therefore, 
depending on the application, contaminants such as pyrogens and 



endotoxins expressed along with the recombinant protein must be removed. 
In addition, extensive post-purification chemical and enzymatic treatments 
(e.g., to refold the protein into an active form) are sometimes required in 
order to obtain a biologically active protein. 

Because proteins are not secreted from prokaryotes like E. coli, 
bacterial cells must be disrupted for product recovery. The subsequent 
release of bacterial contaminants and other proteins make product 
purification more difficult and expensive. Because purification accounts for 
up to 90 % of the total cost of producing recombinant proteins in bacteria, 
proteins like Insulin can cost several thousand dollars per gram when 
recombinantly produced in, and subsequently isolated from, E. coli. 

Because of the many limitations associated with prokaryotic hosts, 
the biotechnology industry has looked for eukaryotic host cultures such as, 
yeast, fungi, insect cells, and mammalian cell tissue culture, to properly and 
efficiently express recombinant proteins. 

For most of the proteins requiring extensive post-translational 
modifications for therapeutic and/or functional activity, mammalian cell 
culture is the most common alternative to E. coli. Although mammalian 
cells are capable of correctly folding and glycosylating bioactive proteins, 
the quality and extent of glycosylation can vary with different culture 
conditions among the same host cells. Furthermore, mammalian culture has 
extremely high fermentation costs (60-80% of total production expense), 
requires expensive media, and poses safety concerns from potential 
contamination by viruses and other pathogens. Yields are generally low and 
in the range of 0.5-1.5% of cellular protein, or micrograms per liter (up to 
300-400 milligrams per liter). 

Yeast, other fungi, and insect cells are currently being used as 
alternatives to mammalian cell culture. Yeast, however, produces 
incorrectly glycosylated proteins that have excessive mannose residues and 



are generally limited in eukaryotic processing. Further, although the 
baculovirus insect cell system can produce high levels of glycosylated 
proteins, these are typically not secreted, making purification complex and 
expensive. Fungi represent the best current system for high-volume, low- 
cost production of recombinant proteins, but they are not capable of 
expressing many target proteins. 

In addition, eukaryotic cultures, require the maintenance of suitable 
conditions for efficient commercially viable expression of proteins. As 
such, the ambient temperature, pH value and aeration level of such cultures 
need to be carefully controlled, while nutrients must be added to the culture 
medium in carefully regulated doses and waste products removed. In 
addition, rigorous aseptic practices must be observed in order to avoid 
contamination by extraneous microbes. Such cultures are thus normally 
grown in sophisticated fermentors or bioreactors which are housed in 
expensively maintained factories. Such overheads are reflected in the high 
price of the recombinant protein end-products. 

To a lesser extent, animals have also been utilized for the production 
of recombinant proteins. Although large amounts of protein can be 
produced and relatively easily recovered from such animals (e.g.* proteins 
specifically produced in mammary glands and secreted with the milk), 
production in such host is limited to the expression of proteins which do not 
interfere with the host physiology. In addition, transgenic animals are 
subject to lengthy lead times to develop herds with stable genetics, high 
operating costs, contamination by animal viruses and a relatively slow rate 
of biomass generation substantially prolonging the time period following 
which recovery of commercial amounts of the protein can be effected. 

The biochemical, technical and economic limitations on existing 
prokaryotic and eukaryotic expression systems has created substantial 



interest in developing new expression systems for the production of 
recombinant proteins. 

Plants represent the most likely alternative to existing expression 
systems. With the availability and on going development of plant 
transformation techniques, most commercially important plant species can 
now be genetically modified to express a variety of recombinant proteins. 

Such transformation techniques include, for example, the 
Agrobacterium vector system, which involves infection of the plant tissue 
with a bacterium (Agrobacterium) into which the foreign gene has been 
inserted. A number of methods for transforming plant cells with 
Agrobacterium are well known (Klee et al, Annu. Rev. Plant Physiol. 
(1987) 38:467-486; Schell and Vasil Academic Publishers, San Diego, 
Calif. (1989) p. 2-25; and Gatenby (1989) in Plant Biotechnology, eds. 
Kung, S. and Arntzen, C. J. 3 Butterworth Publishers, Boston, Mass. p. 93- 
112). 

The biolistic or particle gun method, which permits genetic material 
to be delivered directly into intact cells or tissues by bombarding 
regeneratable tissues, such as meristems or embryogenic callus, with DNA- 
coated microparticles has contributed to plant transformation simplicity and 
efficiency. The microparticles penetrate the plant cells and act as inert 
carriers of a genetic material to be introduced therein. Microprojectile 
bombardment of embryogenic suspension cultures has proven successful for 
the production of transgenic plants of a variety of species. Various 
parameters that influence DNA delivery by particle bombardment have 
been defined (Klein et al, Bio/Technology (1998) 6:559-563; McCabe et 
al., Bio/Technology (1998) 6:923-926; and Sanford, Physiol. Plant. (1990) 
79:206-209). 

Micropipette systems are also used for the delivery of foreign DNA 
into plants via microinjection (Neuhaus et al, Theor. Appl. Genet. (1987) 



75:30-36; and Neuhaus and Spangenberg, PhysioL Plant. (1990) 79:213- 
217). 

Other techniques developed to introduce foreign genes into plants 
include direct DNA uptake by plant tissue, or plant cell protoplasts (ScHell 
and Vasil (1987) Academic Publishers, San Diego, Calif, p. 52-68; and 
Toriyama et aL, Bio/Technology (1988) 6:1072-1074) or by germinating 
pollen (Chapman, Mantell and Daniels (1985) W. Longman, London, p. 
197-209; and Ohta, Proc. Natl. Acad. Sci. USA (1986) 83:715-719). 

DNA uptake induced by brief electric shock of plant cells has also 
been described (Zhang et aL, Plant. Cell. Rep. (1988) 7:379-384 and Fromm 
et aL, Nature (1986) 319:791-793). 

In addition, virus mediated plant transformation has also been 
extensively described. Transformation of plants using plant viruses is 
described, for example, in U.S. Pat. No. 4,855,237 (BGV), EP-A 67,553 
(TMV), Japanese Published Application No. 63-14693, EPA 194,809, EPA 
278,667, and Gluzman et aL, (1988) Communications in Molecular 
Biology: Viral Vectors, Cold Spring Harbor Laboratory, New York, pp. 
172-189. Pseudovirus particles for use in expressing foreign DNA in many 
hosts, including plants, have also been described , for reference, see, for 
example WO 87/06261. 

The production of recombinant proteins and peptides in plants has 
been investigated using a variety of approaches including transcriptional 
fusions using a strong constitutive plant promoter (e.g., from cauliflower 
mosaic virus, Sijmons et aL, Bio/Technology (1990) 8:217-221); 
transcriptional fusions with organ specific promoter sequences (Radke et 
al. } Theoret. Appl. Genet, (1988) 75:685-694); and translational fusions 
which require subsequent cleavage of a recombinant protein 
(Vanderkerckove et aL, Bio/Technology (1989) 7:929-932). 



The application of such genetic transformation techniques has 
allowed the incorporation of a variety of important genetic traits for crop 
improvement and also for the biotechnological production of extractable, 
valuable, foreign proteins including enzymes, vaccine proteins and 
antibodies. 

Foreign proteins that have been successfully expressed in plant cells 
include proteins from bacteria (Fraley et al Proc. Natl Acad. Sci. U.S A 
(1993) 80:4803-4807), animals (Misra and Gedamu, Theor. Appl. Genet 
(1989) 78:161-168), fungi and other plant species (Fraley et al Proc. Natl. 
Acad. Sci. U.S.A. (1983) 80:4803-4807). Some proteins, predominantly 
markers of DNA integration, have been expressed in specific cells and 
tissues including seeds (Sen Gupta-Gopalan et al Proc. Natl. Acad. Sci. 
U.S.A. (1985) 82:3320-3324; Radke et al Theor. Appl. Genet. (1988) 
75:685-694). 

Due to the advantageous economics of field-grown crops, the ability 
to synthesize proteins in storage organs like tubers, seeds, fruits and leaves 
and the ability of plants to perform many of the post-translational 
modifications previously described, several plant expression systems are 
currently investigated for potential as highly effective and economically 
feasible systems for the production of recombinant proteins. 

Since highly expressive systems such as the ubiquitin fusion system 
described in U.S. Pat. No. 5,773,705 have been demonstrated, a major 
hurdle to an effective plant expression system resides with the relatively 
complicated purification procedures necessary in order to purify the 
recombinant protein. 

As such, alternative expression approaches have been undertaken in 
an effort to simplify the purification procedure of the recombinant protein 
from the plant cells. 
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One such system focuses on the use of seed-storage protein 
promoters as a means of deriving seed-specific expression. Using such a 
system, Vanderkerckove et al, (Bio/Technol. (1989) 7:929-932) expressed 
the peptide Leu-enkephalin in seeds of Arabidopsis thaliana and Brassica 
napus. The level of expression of this peptide was quite low and it 
appeared that expression of this peptide was limited to endosperm tissue. 

Another system utilizing seeds as an expression host is disclosed in 
U.S. Pat. No. 5,888,789. This system provides for the secretion of 
heterologous protein by malting of monocot plant seeds. The heterologous 
genes are expressed during germination of the seeds and isolated from a 
malt. 

U.S. Pat. No. 5,580,768 describes a method of producing a 
genetically transformed fluid-producing plant. The genetically transformed 
plant which can be for example, a rubber secreting (Hevea) plant is capable 
of expressing the target product in the fluid that it produces which in this 
case is latex. 

U.S. Pat No. 5,650,554 describes the use of a class of genes called 
oil body protein genes, that have unique features, allowing the production of 
recombinant proteins that can be easily separated from other host cell 
components. 

Many additional expression systems have been described utilizing 
specific targeting or directing of recombinant proteins to specific plant 
tissues. 

Although systems which target or direct recombinant protein 
production to specific tissues allow for easier recombinant protein isolation 
such systems are typically limited in the effective host range and/or the 
amounts of recombinant proteins produced since such systems fail to exploit 
the entire plant biomass. 



A novel approach for simplifying the purification of recombinant 
enzymes from plant host cells is disclosed in U.S. Pat. No. 5,474,925 
which describes an expression construct utilizing a signal peptide 
translationally fused to a recombinant enzyme which targets the enzyme to 
the cellulose matrix of the cell wall. This enables the isolation of the 
enzyme along with the easily recoverable cellulose matrix. This system is 
utilized for the localized expression of commercially important enzymes in 
cotton fibers. According to this system, the expressed enzymes are 
recovered along with the cellulosic matter of the fibers. The enzyme- 
cellulose matrix recovered, is directly utilized for commercial enzymatic 
processes. 

Although this system presents a simple means with which a 
recombinant protein can be expressed and isolated, it is limited to the 
production of enzymes in cotton fibers of the cotton plant. 

Furthermore, a major hurdle encountered when expressing cellulose 
targeted proteins within a plant is the interference of the expressed products 
in the natural formation of the cell wall, which typically results in growth 
arrest of the plant growth. Although this hurdle can be overcome by, for 
example, targeting the protein to specific plant tissue as is the case for U.S. 
Pat. No. 5,474,925, this targeting severely limits the expressing biomass 
and as such the quantity of the expressed protein. In addition, targeting the 
expression to a specific plant tissue also limits the number of plant species 
which can be effectively utilized for such an expression. 

There is thus a widely recognized need for, and it would be highly 
advantageous to have, a plant expression system and method which provide 
high level of expression of a recombinant protein and which allow simple 
and effective recovery of the expressed recombinant protein devoid of the 
above limitations. 



10 



SUMMARY O F TH E I NVENTION 

According to one aspect of the present invention there is provided a 
process of expressing a recombinant protein in a plant and of isolating the 
recombinant protein from the plant, the process comprising the steps of (a) 
providing a plant, a plant derived tissue or cultured plant cells expressing a 
fusion protein including the recombinant protein and a cellulose binding 
peptide being fused thereto, the fusion protein being compartmentalized 
within cells of the plant, plant derived tissue or cultured plant cells, so as to 
be sequestered from cell walls of the cells of the plant, plant derived tissue 
or cultured plant cells; (b) homogenizing the plant, plant derived tissue or 
cultured plant cells, so as to bring into contact the fusion protein with a 
cellulosic matter of the plant, plant derived tissue or cultured plant cells, to 
thereby effect affinity binding of the fusion protein via the cellulose binding 
peptide to the cellulosic matter, thereby obtaining a fusion protein cellulosic 
matter complex; and (c) isolating the fusion protein cellulosic matter 
complex. 

According to further features in preferred embodiments of the 
invention described below, the process further comprising the steps of 
washing the fusion protein cellulosic matter complex, thereby removing 
endogenous plant proteins and other plant material therefrom and collecting 
the fusion protein cellulosic matter complex as a final product of the 
process. 

According to still further features in the described preferred 
embodiments the process further comprising the steps of washing the fusion 
protein cellulosic matter complex, thereby removing endogenous plant 
proteins and other plant material therefrom, exposing the fusion protein 
cellulosic matter complex to conditions effective in dissociating the fusion 
protein from the cellulosic matter; and isolating the fusion protein, thereby 
obtaining an isolated fusion protein. 
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According to still further features in the described preferred 
embodiments the process further comprising the steps of exposing the 
isolated fusion protein to conditions effective in digesting the fusion protein 
so as to release the recombinant protein therefrom, thereby obtaining a 
5 released recombinant protein and isolating the released recombinant protein. 

According to still further features in the described preferred 
embodiments the process further comprising the steps of washing the fusion 
protein cellulosic matter complex, thereby removing endogenous plant 
proteins and other plant material therefrom, exposing the fusion protein 
10 cellulosic matter complex to conditions effective in digesting the fusion 
protein so as to release the recombinant protein therefrom, thereby 
obtaining a released recombinant protein, and isolating the released 
recombinant protein. 

According to still further features in the described preferred 
15 embodiments, the conditions effective in dissociating the fusion protein 
from the cellulosic matter are selected from the group consisting of basic 
conditions, denaturative conditions and affinity displacement conditions. 

According to still further features in the described preferred 
embodiments, the conditions effective in digesting the fusion protein so as 
20 to release the recombinant protein therefrom are selected from the group 
consisting of proteolysis effected via a protease and proteolysis effected 
under predetermined cis or trans conditions for digesting a controllable 
intervening protein sequence. 

According to another aspect of the present invention there is 
25 provided a genetically modified or viral infected plant or cultured plant cells 
expressing a fusion protein including a recombinant protein and a cellulose 
binding peptide 

According to further features in preferred embodiments of the 
invention described below, the fusion protein is compartmentalized within 
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cells of the plant or cultured plant cells, so as to be sequestered from cell 
walls of the cells of the plant or cultured plant cells. 

According to still further features in the described preferred 
embodiments the fusion protein is compartmentalized within a cellular 
compartment selected from the group consisting of cytoplasm, endoplasmic 
reticulum, golgi apparatus, oil bodies, starch bodies, chloroplastids, 
chloroplasts, chromoplastids, chromoplasts, vacuole, lysosomes, 
mitochondria, and nucleus. 

According to still further features in the described preferred 
embodiments expression of the fusion protein is under a control of a 
constitutive or tissue specific plant promoter. 

According to still further features in the described preferred 
embodiments the fusion protein includes a recombinant protein and a 
cellulose binding peptide separated therebetween via a unique amino acid 
sequence recognizable and digestible by a protease or under predetermined 
cis or trans conditions for digesting a controllable intervening protein 
sequence. 

According to yet another aspect of the present invention there is 
provided a composition of matter comprising (a) a plant derived cellulosic 
matter; and (b) a fusion protein including a recombinant protein and a 
cellulose binding peptide separated therebetween via a unique amino acid 
sequence recognizable and digestible by a protease or under predetermined 
cis or trans conditions for digesting a controllable intervening protein 
sequence, the fusion protein being complexed to the plant derived cellulosic 
matter by affinity binding via the cellulose binding peptide. 

According to still another aspect of the present invention there is 
provided a nucleic acid molecule comprising (a) a promoter sequence for 
directing protein expression in plant cells; (b) a heterologous nucleic acid 
sequence including (i) a first sequence encoding a cellulose binding peptide; 
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(ii) a second sequence encoding a recombinant protein, wherein the first and 
second sequences are joined together in frame; optionally (iii) a third 
sequence encoding a signal peptide for directing a protein to a cellular 
compartment, the third sequence being upstream and in frame with the first 
5 and second sequences; and/or optionally (iv) a fourth sequence encoding a 
unique amino acid sequence being recognizable and digestible by a protease 
or under predetermined cis or trans conditions for digesting a controllable 
intervening protein sequence, the fourth sequence being between and in 
frame with the first and second sequences, wherein, the heterologous 

10 nucleic acid sequence being down stream the promoter sequence, such that 
expression of the heterologous nucleic acid sequence is effectable by the 
promoter sequence. 

According to further features in preferred embodiments of the 
invention described below, the nucleic acid molecule further comprising a 

15 sequence element selected from the group consisting of an origin of 
replication for propagation in bacterial cells, at least one sequence element 
for integration into a plant's genome, a polyadenylation recognition 
sequence, a transcription termination signal, a sequence encoding a 
translation start site, a sequence encoding a translation stop site, plant RNA 

20 virus derived sequences, plant DNA virus derived sequences, tumor 
inducing (Ti) plasmid derived sequences, and a transposable element 
derived sequence. 

The present invention successfully addresses the shortcomings of the 
presently known configurations by providing a novel process of expressing 

25 and isolating recombinant proteins and recombinant protein products from 
plants, plant derived tissue or cultured plant cells, which process exploits (i) 
the high affinity between cellulose binding peptides and cellulose; (ii) the 
inherent abundance of cellulose in planta; and (iii) the simplicity associated 
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with cellulose isolation from plants, plant derived tissue and/or cultured 
plant cells. 

5 

BRIEF DESC R IPTIO N O F TH E DRAWI NGS 

The invention is herein described, by way of example only, with 
reference to the accompanying drawings, wherein: 
10 FIG. 1 is a process flow chart demonstrating the process according 

to the present invention. 



15 D E SCRIP T ION OF THE PREFERRED E MBODIMENTS 

The present invention is of a process which can be used for 
expressing and isolating recombinant proteins and recombinant protein 
products from plants, plant derived tissue or cultured plant cells. The 
present invention is further of nucleic acid molecules and genetically 

20 modified or viral infected plants or plant cells which are useful while 
implementing the process, and further of a novel composition of matter 
which results from the process. Specifically, the present invention can be 
used to obtain large quantities of the recombinant proteins and the 
recombinant protein products in a simple and cost effective manner, since 

25 the process according to the present invention exploits (i) the high affinity 
between cellulose binding peptides and cellulose; (ii) the inherent 
abundance of cellulose in planta; and (iii) the simplicity associated with 
cellulose isolation from plants, plant derived tissue and/or cultured plant 
cells. 
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The principles and operation of a process according to the present 
invention may be better understood with reference to the drawings and 
accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, 
it is to be understood that the invention is not limited in its application to the 
details of construction and the arrangement of the components set forth in 
the following description or illustrated in the drawings. The invention is 
capable of other embodiments or of being practiced or carried out in various 
ways. Also, it is to be understood that the phraseology and terminology 
employed herein is for the purpose of description and should not be 
regarded as limiting. 

Process: 

Referring now to the drawings, Figure 1 is a flow chart illustrating 
the process according to the teachings of the present invention. 

The process according to the present invention is directed at 
expressing a recombinant protein in a plant and further at isolating the 
recombinant protein from the plant. The process according to the present 
invention is effected by first providing a plant, a plant derived tissue or 
cultured plant cells (which are referred to herein below individually and 
collectively as "plant material") 10 expressing a fusion protein which 
includes the recombinant protein and a cellulose binding peptide fused 
thereto. The fusion protein is compartmentalized within cells of the plant 
material, so as to be sequestered from cell walls of the cells of the plant 
material. As used herein in the specification and in the claims section that 
follows, the phrase "cultured plant cells" includes both non-differentiated 
plant cell cultures and some what more differentiated callus cultures. 

Compartmentalizing and thereby sequestering the fusion protein 
from the cell walls is an essential feature of the present invention because 
high levels of exspressed cellulose binding peptide associated with plant 
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cell walls inhibit plant growth to a great extent. See to this effect U.S. Pat. 
applications Nos. 09/006,632; 09/006,636; and PCT/IL9 8/0034 5 (WO 
99/07830), 

When sufficient expression has been detected by sampling and 
5 testing the plant material as further detailed hereinunder, the plant material 
is homogenized 12 so as to bring into contact the fusion protein with a 
cellulosic matter of the plant material, to thereby effect affinity binding 14 
of the fusion protein via the cellulose binding peptide to the cellulosic 
matter, thereby obtaining a fusion protein cellulosic matter complex. 
13 10 Conditions such as, but not limited to, temperature, pH, salt concentration, 
Ui time and the like are preferably set so as to allow maximal binding. Such 

i : y 

q conditions are well know to the skilled artisan and can be experimentally 

m 

\Xl modified to best suit a specific application. Sampling and testing can be 

employed to monitor the binding process, as further detailed hereinunder. 

?n 15 When sufficient binding has occurred the fusion protein cellulosic 

matter complex is collected or isolated 16 by methods well known to the 

^ skilled artisan which method are traditionally employed for isolation of 

cellulosic matter from plant material. Thereafter, a wash step 18 is 
employed to remove unbound material, including, in particular, unbound 
20 endogenous plant proteins, thereby isolating the fusion protein cellulosic 
matter complex. The wash step can be repeated one or several times with a 
single or several wash solutions, each of which can include in addition to 
water, buffers, salts, detergents and the like to efficiently effect the removal 
of unbound matter from the fusion protein cellulosic matter complex. The 
25 wash step can be effected in solution using appropriate stiring, however, 
advantageously, the wash step is effected within a column into which the 
collected or isolated fusion protein cellulosic matter complex is packed and 
subsequently washed. 
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According to one embodiment of the present invention, and as 
indicated in Figure 1 by numeral 20, the fusion protein cellulosic matter 
complex is collected as a final product of the process. Such a final product 
can serve as a pack for affinity columns. In this case the recombinant 
protein is selected to have affinity to a ligand, which can then be affinity 
purified via a column packed with the fusion protein cellulosic matter 
complex, in a manner otherwise similar to that described in U.S. Pat. No. 
5,474,925, which is incorporated herein by reference. One of the 
advantages of the process described herein over the teachings of U.S. Pat. 
No. 5,474,925 is that by sequestering the fusion protein from the cell walls 
one can achieve very high expression of the fusion protein as compared to 
the low expression levels practically enabled by U.S. Pat. No. 5,474,925, 
because no deleterious effect on plant growth is exerted. As a result, the 
specific activity of the fusion protein cellulosic matter complex formed 
according to the present invention, i.e., the number of fusion protein 
molecules per weight of cellulosic matter, is far superior. Further details 
relating to the effect of high cellulose binding peptide expression on plant 
development see also PCT/IL9 8/000345. 

According to another embodiment of the present invention, as 
indicated by numeral 22, the final product of the process according to the 
present invention is the fusion protein itself 24. Thus, according to this 
embodiment of the present invention, conditions effective in dissociating 
the fusion protein from the cellulosic matter are used to effect such 
dissociation. The dissociated fusion protein is thereafter readily isolated by 
any conventional separation technique, such as, but not limited to, elution or 
size separation, such as differential filtration or centrifugation, thereby 
obtaining an isolated fusion protein. Conditions effective in dissociating the 
fusion protein from the cellulosic matter include, but are not limited to, 
basic conditions (e.g., 20 mM Tris pH 12) which are known to dissociate all 
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cellulose binding peptides from cellulose, denaturative conditions, or 
affinity displacement conditions, e.g., using 200 nM glucose or cellobiose 
which are know in their ability to elute family IX cellulose binding domains 
(CBDs). Alternatively, a protein cleavage site can be inserted in the 
5 cellulose binding peptide to facilitate the dissociation of the fusion protein 
by specific proteolysis, for example. See to this effect and to other uses of 
CBD-fusion proteins U.S. Pat. Nos. 5,719,044; 5,670,623; 5,856,021; 
5,137,819; 5,202,247; 5,340,731; and 5,474,925; and U.S. Pat. applications 
Nos. 08/788,621; and 08/788,622; EP 0 381 719 Bl, and EP application No. 

U 10 93907724.4. See also the teachings of U.S. Pat. No. 5,834,247, which is 

l.j further described hereinunder. 
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As indicated by numeral 26, the fusion protein thus isolated can be 
exposed to conditions effective in digesting the fusion protein so as to 
release the recombinant protein therefrom, thereby obtaining a released 

15 recombinant protein which can be thereafter isolated as a final product 28. 
Conditions effective in digesting the fusion protein so as to release the 
recombinant protein therefrom include, but are not limited to, proteolysis 
effected via a protease, such as, but not limited to, Factor Xa, enterokinase, 
thrombin, trypsin, papain, pepsin, chemotrypsin and the like, or proteolysis 

20 effected via controllable intervening protein sequence (CIVPS) inserted into 
or adjacent the cellulose binding peptide, the CIVPS are capable of excision 
from or cleavage of the peptide under predetermined conditions in cis or in 
trans, e.g., increase in temperature, exposure to light, unblocking of amino 
acid residues by dephosphorylation and treatment with chemical reagents or 

25 deglycosylation, examples include proteolysis effected under acidic 
conditions (HQ, e.g., to cleave between Asp and Pro) and proteolysis 
effected by a proteolysing reagent, such as CNBr to cleave downstream of 
Met, all as known in the art and/or as further described in U.S. Pat. No. 
5,834,247, which is incorporated herein by reference. 
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Isolating final product 28 from other proteolytic products derived, for 
example, from the cellulose binding peptide, can be effected by any one of a 
number of protein isolation techniques well known to the skilled artisan, 
including, but not limited to, affinity separation via, for example, antibodies 
5 bound to a solid support, size and/or charge based separation via gel 
electrophoresis or chromatography, and the like. Additional methods 
include, but are not limited to, fractionation, gel-filtration, ion-exchange, 
hydrophobic, and affinity chromatography, ultrafiltration and 
crystallization. 

10 According to an alternative embodiment of the process of the present 

invention, as indicated in Figure 1 by numeral 30, the washed fusion protein 
cellulose matter complex resulting from step 18 is exposed to conditions 
effective in digesting the fusion protein so as to release the recombinant 
protein therefrom, thereby obtaining a released recombinant protein. These 

15 conditions are similar to those described with respect to step 26. The 
released recombinant protein is thereafter readily isolated by any 
conventional separation technique, such as, but not limited to, displacement 
or size separation, such as differential filtration or centrifugation, thereby 
obtaining an isolated recombinant protein final product 32. 

20 Cellulose binding peptides: 

As used herein in the specification and in the claims section below, 
the phrase "cellulose binding peptide" includes peptides e.g., proteins and 
domains (portions) thereof, which are capable of, when expressed in plant 
cells, affinity binding to a plant derived cellulosic matter following 

25 homogenization and cell rupture. The phrase thus includes, for example, 
peptides which were screened for their cellulose binding activity out of a 
library, such as a peptide library or a DNA library (e.g., a cDNA 
expression library or a display library) and the genes encoding such 
peptides isolated and are expressible in plants. Yet, the phrase also 
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includes peptides designed and engineered to be capable of binding to 
cellulose and/or units thereof. 

Such peptides include amino acid sequences expressible in plants 
that are originally derived from a cellulose binding region of, e.g., a 

5 cellulose binding protein (CBP) or a cellulose binding domain (CBD). 
The cellulose binding peptide according to the present invention can 
include any amino acid sequence expressible in plants which binds to a 
cellulose polymer. For example, the cellulose binding domain or protein 
can be derived from a cellulase, a binding domain of a cellulose binding 

10 protein or a protein screened for, and isolated from, a peptide library, or a 
protein designed and engineered to be capable of binding to cellulose or to 
saccharide units thereof, and which is expressible in plants. The cellulose 
binding domain or protein can be naturally occurring or synthetic, as long 
as it is expressible in plants. Suitable polysaccharidases from which a 

15 cellulose binding domain or protein expressible in plants may be obtained 
include |3-l,4-glucanases. In a preferred embodiment, a cellulose binding 
domain or protein from a cellulase or scaffoldin is used. Typically, the 
amino acid sequence of the cellulose binding peptide expressed in plants 
according to the present invention is essentially lacking in the hydrolytic 

20 activity of a polysaccharidase (e.g., cellulase, chitinase), but retains the 
cellulose binding activity. The amino acid sequence preferably has less 
than about 10 % of the hydrolytic activity of the native polysaccharidase; 
more preferably less than about 5 % , and most preferably less than about 
1 % of the hydrolytic activity of the native polysaccharidase, ideally no 

25 activity altogether. 

The cellulose binding domain or protein can be obtained from a 
variety of sources, including enzymes and other proteins which bind to 
cellulose which find use in the subject invention. 
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In Table 4 below are listed those binding domains which bind to 
one or more soluble/insoluble polysaccharides including all binding 
domains with affinity for soluble glucans (a, p, and/or mixed linkages). 
The Nl cellulose-binding domain from endoglucanase CenC of C, firm is 
the only protein known to bind soluble cellosaccharides and one of a small 
set of proteins which are known to bind any soluble polysaccharides. 
Also, listed in Tables 1 to 3 are examples of proteins containing putative (3 
-1,3-glucan-binding domains (Table 1); proteins containing Streptococcal 
glucan-binding repeats (Cpl superfamily) (Table 2); and enzymes with 
chitin-binding domains, which may also bind cellulose (Table 3). The 
genes encoding each one of the peptides listed in Tables 1-4 are either 
isolated or can be isolated as further detailed hereinunder, and therefore, 
such peptides are expressible in plants. Scaffoldin proteins or portions 
thereof, which include a cellulose binding domain, such as that produced 
by Clostridium cellulovorans (Shoseyov et al, PCT/US94/04132) can also 
be used as the cellulose binding peptide expressible in plants according to 
the present invention. Several fungi, including Trichoderma species and 
others, also produce polysaccharidases from which polysaccharide binding 
domains or proteins expressible in plants can be isolated. Additional 
examples can be found in, for example, Microbial Hydrolysis of 
Polysaccharides, R. A. J. Warren, Annu. Rev. Microbiol. 1996, 
50:183-212; and "Advances in Microbial Physiology" R. K. Poole, Ed., 
1995, Academic Press Limited, both are incorporated by reference as if 
fully set forth herein. 
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Table 1 

Overview of proteins containing putative fi-1,3 glucan-binding domains 



Source (strain) 



Type I 



Protein 



accession No. 



Ref 1 



B. circulans (WL-12) GLCA1 

B. circulans (IAM 1 165) BglH 

Type II 

Actinomadura sp. (FC7) Xynll 

Arthrobacter sp. (YCWD3) GLCI 

O. xanthineolytica GLC 
R. faecitabidus (YLM-50) 

R. communis Ricin 

S. Uvidans (1326) XlnA 

T. trident at us FactorGa 



RP I 



P23903/M34503/JQ0420 1 
JN0772/D17519/S67033 2 



U08894 3 
D23668 9 
P22222/M60826/A39094 4 

Q05308/A45053/D 10753 5a,b 
A 12892 6 
P26514/M64551/JS07986 7 
D 16622 8 



B. : Bacillus, O. : Oerskovia, R. faecitabidus : Rarobacier faecitabidus, R. communis: Ricinus 
communis, 5. : Streptomyces, T. : Tachypleus (Horseshoe Crab) 

^References: 

1) Yahata et al. (1990) Gene 86, 1 13-1 17 

2) Yamamoto et al (1993) Biosci. Biotechnol. Biochem. 57, 1518-1525 

3) Harpin et al ( 1994) EMBL Data Library 

4) Shene/a/. (1991)7. Biol Chem. 266,1058-1063 

5a) Shimoi et al. (1992)7. Biol Chem, 267,25189-25195 
5b) Shimoi et al (1992) J. Biochem 110, 608-613 

6) Horn et al (1989) Patent A 12892 

7) Shareck et al (1991) Gene 107, 75-82 

8) Sekiera/. (1994)/. Biol Chem. 269,1370-1374 

9) Watanabe et al (1993) EMBL Data Library 
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Table 2 

Overview of proteins containing Streptococcal glucan-binding repeats (Cpl 

superfamily) 



Source 



Protein 



Accession No. 



Ref. 





S. 


downei (sobrinus) (0MZ176) 


GTF-I 


D13858 1 


45 


S. 


downei (sobrinus) (MFe28) 


GTF-1 


P11001/M17391 2 




s. 


downei (sobrinus) (MFe28) 


GTF-S 


P29336/M30943/A41483 3 




s. 


downei (sobrinus) (6715) 


GTF-I 


P27470/D90216/A38175 4 




s. 


downei (sobrinus) 


DEI 


L34406 5 


50 


s. 


mutants (Ingbritt) 


GBP 


M30945/A37184 6 




s. 


mutants (GS-5) 


GTF-B 


A33128 7 




s. 


mutants (GS-5) 


GTF-B 


P08987/M17361/B33135 8 




s. 


mutants 


GXF _ B 3'-ORF 


P05427/C33135 8 
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• 


23 






S. mutants (GS-5) 


GTF-C 


P 1 3470/M 1 73 6 1 /M22054 


9 


0, mill (HI IS \\jj~J) 




ItDl dvdaldUlC 


m 
i\j 


J. fflUlOJllS \\Ji>~D) 


r;TF n 




1 1 

1 1 


$ C/7 / 1 v/i riuK 


GTF-J 


A4481 1/S22726/S28809 


12 






Z11873/M64111 




5. salivarius 


GTF-K 


S22737/S22727/ZU872 


13 


S salivarius fATCC25975'i 


GTF-L 


L35495 


14 


S. salivarius (ATCC25975) 


GTF-M 


L35928 


14 


lj. ufiz-UJiiuiiiUL rvvj 


I vtA 


P06653 / A25634 /M 1 38 1 2 


15 


o. fsricuiiiufHUc 


Ptn A 


A41971/M74122 


16 


Phage HB-3 


HBL 


P32762/M34652 


17 


Phage Cp-1 


CPL-1 


P15057/J03586/A31086 


18 


Pha^e Cn-9 


CPl -9 


P 1 93 86/M347 80/JO043 8 


19 


Phaue FJ-1 


FJI 


A42936 


20 




Toy A 


PI 61 54/A37052/M30307 


21 










C. difficile (BARTS Wl) 


ToxA 


A60991/X17194 


22 


C. dijflicife (VPI 10463) 


ToxB 


P18177/X53138/X60984 


23,24 










C tfigficite (1470) 


ToxB 


S44271/Z23277 


25,26 


C novyi 


a- toxin 




£ / 


o . nuvyi 


3~tOXUl 




?R 
£o 


C. acetobutylicum (NCIB8052) 


CspA 


S49255/Z37723 


29 


C. acetobutylicum (NCIB8052) 


CspB 


Z50008 


30 


C acetobutylicum (NCIB8052) 


CspC 


Z50033 


30 


C. acetobutylicum (NCIB8052) 


CspD 


Z50009 


30 



35 -^References: 

1) Sato et al (1993) DNA sequence 4, 19-27 

2) Ferreti et al (1987)/, Bacteriol. 169,42714278 

3) Gilmore et al. (1990)7. Infect. Immun. 58,2452-2458 
40 4)Aboerai. (1991)7. Bacteriol. 173,989-996 

5) Sun etal (1994)7. Bacteriol. 176,7213-7222 

6) Banase/a*. (1990)7. Infect. Immun. 58,667-673 

7) Shiroza et al. (1990) Protein Sequence Database 

8) Shiroza et al. {mi) J. Bacteriol. 169,4263-4270 
45 9) Ueda et al. (1988) Gene 69, 101-109 

10) Russel(1990)Xrc/i. Oral, Biol. 35,53-58 

11) Hondas al. (1990)7. Gen. Microbiol. 136,2099-2105 

12) Giffardera/. (1991)7. Gen. Microbiol. 137,2577-2593 

13) Jacques (1992) EMBL Data Library 

50 14) Simpson et al. (1995)7. Infect. Immun. 63,609-621 

15) Gargia et al. (1986) Gene 43, 265-272 

16) Yother ex al (1992)7. BacterioL 174,601-609 

17) Romero et al (1990)7. Bacteriol. 172, 5064-5070 

18) Garcia et al. (1988) Proc. Natl. Acad. Sci, USA 85, 914-918 
55 19) Garcia et al. (1990) Gene 86, 81-88 

20) Diaz etal. (1992)7. Bacteriol. 174,5516-5525 

21) Dove etal. (1990)7. Infect. Immun. 58,480-488 

22) Wren et al. (1990) FEMS Microbiol. Lett. 70, 1-6 
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23) Barroso et a. (1990) Nucleic Acids Res. 18, 4004-4004 

24) von Eichel-Streiber et at. (1992) Mot. Gen. Genet. 233,260-268 

25) Sartinger et al (1993) EMBL Data Library 

26) von Eichel-Streiber et al (1995) Mol. Microbiol In Press 
5 27) Hofmann et al. (1993) EMBL Data Library 

28) Hofinann et al (1995) Mol Gen. Genet. In Press 

29) Sanchez et al. (1994) EMBL Data Library 

30) Sanchez et al. (1995) EMBL Data Library 

New cellulose binding peptides with interesting binding 

10 characteristics and specificities can be identified and screened for and the 
genes encoding same isolated using well known molecular biology 
approaches combined with a variety of other procedures including, for 
example, spectroscopic (titration) methods such as: NMR spectroscopy 
(Zhu et al Biochemistry (1995) 34:13196-13202, Gehring et al 

15 Biochemistry (1991) 30:5524-5531), UV difference spectroscopy 
(Belshaw et al Eur. J. Biochem. (1993) 211:717-724), fluorescence 
(titration) spectroscopy (Miller et al J. Biol Chem. (1983) 258:13665- 
13672), UV or fluorescence stopped flow analysis (De Boeck et al. Eur. 
J. Biochem. (1985) 149:141-415), affinity methods such as affinity 

20 electrophoresis (Mimura et al J. chromatography (1992) 597:345-350) 
or affinity chromatography on immobilized mono or oligosaccharides, 
precipitation or agglutination analysis including turbidimetric or 
nephelometric analysis (Knibbs et al. J. BioL Chem. (1993) 14940- 
14947), competitive inhibition assays (with or without quantitative IC50 

25 determination) and various physical or physico-chemical methods 
including differential scanning or isothermal titration calorimetry 
(Sigurskjold et al J. Biol Chem. (1992) 267:8371-8376; Sigurskjold et 
al Eur. J. Biol. (1994) 225:133-141) or comparative protein stability 
assays (melts) in the absence or presence of oligo saccharides using 

30 thermal CD or fluorescence spectroscopy. 

The K a for binding of the cellulose binding domains or proteins to 
cellulose is at least in the range of weak antibody-antigen extractions, i.e., 
> 10 3 , preferably 10 4 , most preferably 10 6 Mr 1 . If the binding of the 
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cellulose binding domain or protein to cellulose is exothermic or 
endothermic, then binding will increase or decrease, respectively, at lower 
temperatures, providing a means for temperature modulation of the 
binding step, see numeral 14 in Figure L 



10 



15 



Table 3 

Overview of enzymes with chitin-binding domains 



Source (strain) 



Enzyme 



Accession No. 



Ref. 3 



Bacteria) enzymes 
Type 1 





Aeromonas sp* (NolOS-24) 


Chi 


D31818 


1 




Bacillus circulans (WL-12) 


Chi A I 


P20S33/M57601/A38368 


2 




Bacillus circulans (WL-12) 


ChiD 


P27050/D10594 


3 




Janthinobacterium lividum 


Chi69 


U07025 


4 


20 


Streptomyces griseus 


Protease C 


A53669 


5 




Type R 










Aeromonas cavia (Kl) 


Chi 


U09139 


6 


25 


AUeromonas sp (0-7) 


Chi85 


A40633/P32823/D13762 


7 




Autographa catifomica (C6) 


NPH-128 a 


P41684/L22858 


8 




Serratia marcescens 


ChiA 


A25090/X03657/LG1455/P07254 


9 




iype in 








30 












Rhizopus oligosporus (IF08631) 


Chil 


P29026/A47022/D 1 01 57/S274 1 8 


10 




Rhizopus oligosporus (IF08631) 


Chi2 


P29027/B47022/D10158/S27419 


10 




Saccharomyces cerevisiae 


Chi 


S5037WU17243 


n 


35 


Saccharomyces cerevisiae 


Chil 


P29028/M74069 


12 




(DBY939) 










Saccharomyces cerevisiae 


Chi2 


P29029/M7407/B41O35 


12 




(DBY918) 








40 


Plant enzymes 










Hevein superfamily 










Allium sativum 


Chi 


M94105 


13 


45 


Amaranthus caudatus 


AMP-l b 


P27275/A40240 


14, 




AmararUhus caudatus 


AMP-2 b 


S37381/A40240 


14, 




Arabidopsis thatiana 


ChiB 


P19171/M38240/B455I1 


16 




(cv. Colombia) 










Arabidopsis thaliana 


PHpc 


U01880 


17 


50 


Brassica napus 


Chi 


U21848 


18 




Brassica napus 


Chi2 


Q09023/M95835 


19 




Hevea brasiliensis 


Hevl d 


P02877/M36986/A03770/A38288 20 
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Hordeum vulgare Chi33 

tycopersicon esculentum Chi9 

Nkotiana tabacum CBP20 e 

Nkotiana tabacum Chi 

Nkotiana tabacum (cv. Havana) Chi 

Nkotiana tabacum (FB7-1) CM 

Nicotiona tabacum (cv. Samsun) Chi 

Nicotiona tabacum (cv. Havana) Chi 

Nkotiana tabacum (cv. BY4) Chi 

Nicotiona tabacum (cv. Havana) Chi 

Oryza sativum (IR36) ChiA 

Oryza sativum ChiB 

Oryza sativum Chi 

Oryza sativum (cv . Japonicum) Chi 

Oryza sativum (cv. Japonicum) Chil 

Oryza sativum Chi2 

O/yza sativum (cv, Japonicum) Chi3 

O/yza sativum ChiA 

Oryza sativum Chil 

Oryza ra/iiwn (IR58) Chi 

Oryza sativum Chi 

Pisum sativum (cv. Birte) Chi 

Pi'jwm sativum (cv, Alcan) Chi2 

Populus trichocarpa Chi 

Populus trichocarpa (H 11 - 1 1 ) Chi 

Phaseolus vulgaris fcv. Saxa) Chi 

Phaseolus vulgaris (cv . Saxa) Chi 

Sambucus nigra PR-3 f 

Seca/e cereale Chi 

Solanum tuberosum ChiBl 

Solanum tuberosum ChiB2 

Solanum tuberosum ChiB3 

Solanum tuberosum ChiB4 

Solanum tuberosum WIN- IS 
(cv. Maris Piper) 

Solanum tuberosum WIN-2S 
(cv. Maris Piper) 

Triticum aestivum Chi 

Triticum aestivum WGA- lfe 

Triticum aestivum WGA~2 n 

Triticum aestivum WGA-3 

Utmus americano Chi 

{/rftaj rfkHca AGL* 

tfgmz unguiculata Chil 
(cv. Red caioona) 



L34211 22 

Q05538/Z15140/S37344 23 

S72424 24 

A21091 25 

A29074/M15173/S20981/S19855 26 

JQ0993/S0828 27 

A16I19 28 

P08252/X16939/S08627 27 
P24091/XS1599/X64519//S13322 26,27,29 

P29059/X64518/S20982 26 

L37289 30 

JC2253/S42829/Z29962 31 

S39979/S40414/X56787 32 

X56063 33 

P24626/X54367/S 14948 34 

P25765/S15997 35 
D 16223 

JC2252/S42828 30 

D16221 32 

U02286 36 

X87109 37 

P36907/X63899 38 

L37876 39 

S18750/S18751/X59995/P29032 40 

U01660 41 

A24215/S43926/Jq0965/P36361 42 
P062 1 5/M 1 3968/M 1 9052/ A25898 43 ,44 ,45 

Z46948 46 

JC2071 47 

U026Q5 48 

U02606 48 

U02607/S43317 48 

U02608 48 

P0976 1 /X 1 3497/S04926 49 

P09762/X 1 3497/S04927 49 

S38670/X76041 50 

P10968/M25536/S09623/S07289 51,52 

P02876/M25537/S09624 5 1 ,53 

P 1 0969/J0296 1 IS 1 0045/ A2840 1 54 

L22032 55 

M87302 56 

X88800 57 
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a NHP ; nuclear polyhedrosis virus endochitinase like sequence; Chi : chitinase, ^anli.-microbial 
peptide, c pre-hevein like protein, d hevein, e chitin-binding protein, ^pathogenesis related protein, 
£ wound-induced protein, ^wheat genri agglutinin, ^agglutinin (lectin). 
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55) Hajela et al. (1993) EMBL Data Library 

50 56) Lemer et al (1992)/. Biol Chem. 267,11085-11091 
57) Vo et al (1995) EMBL Data library 
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Table 4 

Sources of polysaccharide binding domains 



Binding Domain 



Proteins Where Binding 
Domain is Found 



10 



15 



Q 20 

ru 



Cellulose Binding 
Domains* 



25 



30 



35 



40 



45 



Starcn/Maltodextrin 
Binding Domains 



Dextran Binding Domains 



p-Glucan Binding Domains 



Chitin Binding Domains 



p-glucanases (avicelases, CM Cases, 
cellodextrinases) 

exoglucanses or cellobiohydrolases 
cellulose binding proteins 
xylanases 

mixed xylanases/glucanases 

esterases 

chitinases 

P'l,3-glncanases 

(3- 1 ,3-(p- 1 ,4)-glucanases 

((J-)mannana$es 

p-glucosidases/galactosidases 

cellulose synthases (unconfirmed) 

a-amylases2»3 
p-amylases^ 
pullulanases 
glucoamylases^' 7 
cyclodextrin glucotransferases^"^ 
(cyclomaltodextrin glucanotransf erases) 
maltodextrin binding proteins 1 1 

(Streptococcal) glycosyl transferases^ 
dextran sucrases (unconfirmed) 
Clostridial toxins 1 3 > 14 
glucoamylases^ 
dextran binding proteins 

p-l,3-glucanases 15 ' 16 
p-l,3-(fM,4)-glucanases (unconfirmed) 
p-l,3-glucan binding protein 

chitinases 
cbitobiases 

chitin binding proteins 

(see also cellulose binding domains) 

Heivein 



50 



55 



Wilkes et al.. Adv. Microbiol Reviews, (1991) 303-315. 
2 S?gaarde*a/., /. Biol. Chem. (1993)268:22480. 
3 WeseIake et al, Cereal Chem. (1983) 60:98. 
4 Svensson et al., J. (1989) 264:309, 
5 Jespersen et al., J. (1991) 280:51. 
6 Belshaw et al, Bur. J. Biochem. (1993)211:717. 
7 Sigurskjoldef<tf., Eur. J. Biochem. (1994) 225:133. 
8 VilIette et al, Biotechnol Appl Biochem. (1992) 16:57. 
9 Fukada#a/.. BioscL Biotechnol Biochem. (1992) 56:556. 
10 Lawson et al, J. Mol Biol (1994)236:590. 
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14 vonEichel-Streiberera^ Mol Gen. Genet. (1992)233:260. 
15 Klebl et ah. J. BacterioL (1989) 171:6259. 
16 Watanabeer a/., J. BacterioL (1992) 174:186. 
17 Duvic Biol. Chem. (1990) :9327. 

5 Thus, and as already stated, the phrase "polysaccharide binding 

peptide" includes an amino acid sequence which comprises at least a 

functional portion of a polysaccharide binding region (domain) of a 

polysaccharidase or a polysaccharide binding protein. The phrase further 

relates to a polypeptide screened for its cellulose binding activity out of a 

io library, such as a peptide library or a DNA library (e.g., a cDNA library 
or a display library). By "functional portion" is intended an amino acid 
sequence which binds to cellulose. 

The techniques used in isolating polysaccharidase genes, such as 
cellulase genes, and genes for cellulose binding proteins are known in the 

15 art, including synthesis, isolation from genomic DNA, preparation from 
cDNA, or combinations thereof. (See, U.S. Pat. Nos. 5,137,819; 
5,202,247; 5,340,731; 5,496,934; and 5,837,814). The sequences for 
several binding domains, which bind to soluble oligosaccharides are 
known (See, Figure 1 of PCT/CA97/00033 , WO 97/26358). The DNAs 

20 coding for a variety of polysaccharidases and polysaccharide binding 
proteins are also known. Various techniques for manipulation of genes 
are well known, and include restriction, digestion, resection, ligation, in 
vitro mutagenesis, primer repair, employing linkers and adapters, and the 
like (see Sambrook et aL, Molecular Cloning - A Laboratory Manual, 

25 Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989, 
which is incorporated herein by reference). 

The amino acid sequence of a polysaccharidase can be used to 
design a probe to screen a cDNA or a genomic library prepared from 
mRNA or DNA from cells of interest as donor cells for a 

30 polysaccharidase gene or a polysaccharide binding protein gene. By using 
the polysaccharidase cDNA or binding protein cDNA or a fragment 
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thereof as a hybridization probe, structurally related genes found in other 
species can be easily cloned and provide a cellulose binding peptide which 
is expressible in plants according to the present invention. Particularly 
contemplated is the isolation of genes from organisms that express 
polysaccharidase activity using oligonucleotide probes based on the 
nucleotide sequences of genes obtainable from an organism wherein the 
catalytic and binding domains of the polysaccharidase are discrete, 
although other polysaccharide binding proteins also can be used (see, for 
example, Shoseyov, el ah, Proc. Nat'l. Acad. Sci. (USA) (1992) 
89:3483-3487), 

Probes developed using consensus sequences for the binding 
domain of a polysaccharidase or polysaccharide-binding protein are of 
particular interest. The (3-1,4-gIycanases from C. fimi characterized to 
date are endoglucanases A, B, C and D (CenA, CenB, CenC and CenD, 
respectively), exocellobiohydrolases A and B (CbhA and CbhB, 
respectively), and xylanases A and D (Cex and XylD, respectively) (see 
Wong et al. (1986) Gene, 44:315; Meinke et al (1991) J. Bacteriol., 
173:308; Coutinho et aL y (1991) MoL Microbiol. 5:1221; Meinke et aL y 
(1993) Bacteriol., 175:1910; Meinke et aL, (1994) MoL Microbiol., 
12:413; Shen et al., Biochem. J., in press; O'Neill et aL, (1986) Gene, 
44:325; and Millward-Sadler et al., (1994) Mol. Microbiol., 11:375). 
All are modular proteins of varying degrees of complexity, but with two 
features in common: a catalytic domain (CD) and a cellulose-binding 
domain (CBD) which can function independently (see Millward-Sadler et 
ah, (1994) Mol. Microbiol., 11:375; Gilkes et al., (1988) J. Biol. 
Chem., 263:10401; Meinke et aL y (1991) J. Bacteriol., 173:7126; and 
Coutinho et al., (1992) Mol, Microbiol., 6:1242). In four of the 
enzymes, CenB, CenD, CbhA and CbhB, fibronectin type III (Fn3) 
repeats separate the N-terminal CD from the C-terminal CBD. The CDs 
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of the enzymes come from six of the families of glycoside hydrolases (see 
Henrissat (1991) Biochem. J., 280:309; and Henrissat et aL, (1993) 
Biochem. J., 293:781); all of the enzymes have an N- or C-terminal CBD 
or CBDs (see Tomme et aL , Adv. Microb. Physiol ., in press); CenC has 
5 tandem CBDs from family IV at its N-terminus; CenB and XylD each 
have a second, internal CBD from families III and II, respectively. Cex 
and XylD are clearly xylanases; however, Cex, hut not XylD, has low 
activity on cellulose. Nonetheless, like several other bacterial xylanases 
(see Gilbert et aL, (1993) J. Gen. Microbiol., 139:187), they have 

10 CBDs. C. firm probably produces other |3-l,4-glycanases. Similar 
systems are produced by related bacteria (see Wilson (1992) Crit. Rev. 
Biotechnol., 12:45; and Hazlewood et aL y (1992) J. Appl. BacterioL, 
72:244). Unrelated bacteria also produce glycanases; Clostridium 
thermocellum, for example, produces twenty or more P-l ,4-glycanases 

15 (see Beguin et aL, (1992) FEMS Microbiol. Lett., 100:523). The CBD 
derived from C. fimi endoglucanase C Nl, is the only protein known to 
bind soluble* cellosaccharides and one of a small set of proteins that are 
known to bind any soluble polysaccharides. 

Examples of suitable binding domains are shown in Figure 1 of 

20 PCT/CA97/00033 (WO 97/26358), which presents an alignment of 
binding domains from various enzymes that bind to polysaccharides and 
identifies amino acid residues that are conserved among most or all of the 
enzymes. This information can be used to derive a suitable 
oligonucleotide probe using methods known to those of skill in the art. 

25 The probes can be considerably shorter than the entire sequence but 
should at least be 10, preferably at least 14, nucleotides in length. Longer 
oligonucleotides are useful, up to the full legnth of the gene, preferably no 
more than 500, more preferably no more than 250, nucleotides in length. 
RNA or DNA probes can be used. In use, the probes are typically labeled 
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in a detectable manner, for example, with 32p 4 3h, biotin, avidin or other 
detectable reagents, and are incubated with single-stranded DNA or RNA 
from the organism in which a gene is being sought. Hybridization is 
detected by means of the label after the unhybridized probe has been 

5 separated from the hybridized probe. The hybridized probe is typically 
immobilized on a solid matrix such as nitrocellulose paper. Hybridization 
techniques suitable for use with oligonucleotides are well known to those 
skilled in the art. Although probes are normally used with a detectable 
label that allows easy identification, unlabeled oligonucleotides are also 

10 useful, both as precursors of labeled probes and for use in methods that 
provide for direct detection of double-stranded DNA (or DNA/RNA). 
Accordingly, the term "oligonucleotide probe" refers to both labeled and 
unlabeled forms. 

Generally, the binding domains identified by probing nucleic acids 
15 from an organism of interest will show at least about 40 % identity 
(including as appropriate allowances for conservative substitutions , gaps 
for better alignment and the like) to the binding region or regions from 
which the probe was derived and will bind to a soluble p-1,4 glucan with 
a K a of > 1()3 M~l. More preferably, the binding domains will be at least 
20 about 60 % identical, and most preferably at least about 70 % identical to 
the binding region used to derive the probe. The percentage of identity 
will be greater among those amino acids that are conserved among 
polysaccharidase binding domains. Analyses of amino acid sequence 
comparisons can be performed using programs in PC/Gene 
25 (IntelliGenetics, Inc.). PCLUSTAL can be used for multiple sequence 
alignment and generation of phylogenetic trees. 

In order to isolate the polysaccharide binding protein or a 
polysaccharide binding domain from an enzyme or a cluster of enzymes 
that binds to a polysaccharide, several genetic approaches can be used. 
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One method uses restriction enzymes to remove a portion of the gene that 
codes for portions of the protein other than the binding portion thereof. 
The remaining gene fragments are fused with expression control 
sequences to obtain a mutated gene that encodes a truncated protein. 
Another method involves the use of exonucleases such as BaBl to 
systematically delete nucleotides either externally from the 5' and the 3 r 
ends of the DNA or internally from a restricted gap within the gene. 
These gene deletion methods result in a mutated gene encoding a 
shortened protein molecule which can then be evaluated for substrate or 
polysaccharide binding ability. 

Any cellulose binding protein or cellulose binding domain may be 
used in the present invention. The term "cellulose binding protein" ("CBP") 
refers to any protein or polypeptide which specifically binds to cellulose. 
The cellulose binding protein may or may not have cellulose or cellulolytic 
activity. The term "cellulose binding domain" ("CBD") refers to any 
protein or polypeptide which is a region or portion of a larger protein, said 
region or portion binds specifically to cellulose. The cellulose binding 
domain (CBD) may be a part or portion of a cellulase, xylanase or other 
polysaccharidase, e.g., a chitinase, etc., a sugar binding protein such as 
maltose binding protein, or scaffoldin such as CbpA of Clostridium 
celluvorans, etc. Many cellulases and hemicellulases (e.g. xylanases 
and mannases) have the ability to associate with cellulose. These enzymes 
typically have a catalytic domain containing the active site for substrate 
hydrolysis and a carbohydrate-binding domain or cellulose-binding domain 
for binding cellulose. The CBD may also be from a non-catalytic 
polysaccharide binding protein. To date, more than one hundred cellulose- 
binding domains (CBDs) have been classified into at least thirteen families 
designated 1-XIII (Tomme et al (1995) "CelluloseBinding Domains: 
Classification and Properties", in ACS Symposium Series 618 Enzymatic 
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Degradation and Insoluble Carbohydrates, pp. 142-161, Saddler and Penner 
eds., American Chemical Society, Washington, D.C. (Tomme I) ; Tomme 
et al Adv. Microb. Physiol. (1995) 37:1 (Tomme II); and Smant et al, 
Proc. NatL Acad. Sci U.S.A. (1998) 95:4906,-491 1, all of which are 
incorporated herein by reference). Any of the CBDs described in Tomme I 
or 11 or any variants thereof, any other presently known CBDs or any new 
CBDs which may be identified can be used in the present invention. As an 
illustrative, but in no way limiting example, the CBP or CBD can be from a 
bacterial, fungal, slime mold, or nematode protein or polypeptide. For a 



10 more particular illustrative example, the CBD is obtainable from 
Clostridium cellulovorans, Clostridium cellulovorans, or Cellulomonas fimi 
(e.g., CenA, CenB, CenD, Cex). In addition, the CBD may be selected 
from a phage display peptide or peptidomimetic library, random or 
otherwise, using e.g., cellulose as a screening agent (See Smith Science 
15 (1985) 228:1315-1317 and Lam, Nature (1991) 354:82-84). Furthermore, 
the CBD may be derived by mutation of a portion of a protein or 
polypeptide which binds to a polysaccharide other than cellulose (or 
hemicellulose) but also binds cellulose, such as a chitinase, which 
specifically binds chitin, or a sugar binding protein such as maltose binding 
20 protein, rendering said portion capable of binding to cellulose. In any 
event, the CBD binds cellulose or hemicellulose. Shoseyov and Doi (Proc. 
Natl. Acad. Sci. USA (1990) 87:2192-2195) isolated a unique cellulose- 
binding protein (CbpA) from the cellulose "complex" of the cellulolytic 
bacterium Clostridium cellulovorans. This major subunit of the cellulose 
25 complex was found to bind to cellulose, but had no hydrolytic activity, and 
was essential for the degradation of crystalline cellulose. The CbpA gene 
has been cloned and sequenced (Shoseyov et al. Proc. Natl. Acad. Sci. 
USA (1992) 89:3483-3487). Using PCR primers flanking the cellulose- 
binding domain of CbpA, the latter was successfully cloned into an 
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overexpression vector that enabled overproduction of the approximately 17 
kDa CBD in Escherichia coli. The recombinant CBD exhibits very strong 
affinity to cellulose and chitin (U.S. Pat. No. 5,496,934; Goldstein et al y 
J. Bacteriol. (1993) 175:5762; PCT International Publication WO 
5 94/241 58, all are incorporated by reference as if fully set forth herein). 

In recent years, several CBDs have been isolated from different 
sources. Most of these have been isolated from proteins that have separate 
catalytic, i.e., cellulose and cellulose binding domains, and only two have 
been isolated from proteins that have no apparent hydrolytic activity but 
10 possess cellulose-binding activity (Goldstein et al. J- Bacteriol. (1993) 
175:5762-5768; Morag et al Appl. (1995) Environ. Microbiol. 61:1980- 
1986). 

Recombinant proteins: 

Any protein for which a gene is known or can be isolated can be used 
15 as the recombinant protein and be fused to the cellulose binding peptide 
according to the present invention. Advantageously, the recombinant 
protein is of a commercial value. A non-exhaustive list of recombinant 
proteins which can be manufactured utilizing the process of the present 
invention and their uses follows. 
20 Thus, for example, glucoamylases and glucose isomerases are used 

in the food processing industry to convert starch to high fructose corn syrup. 

Another useful class of enzymes are proteinases, which are used for 
the hydrolysis of high molecular weight proteins and which are further used 
in combination with detergents in cleaning applications, in leather 
25 manufacturing processes, in the food industry, and in the manufacture of 
alcoholic beverages. 

Enzymes known as pectinesterases, and several related enzymes, are 
used for pectin hydrolysis in the food industry. 
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A class of enzymes known as lipases axe used for the cleavage of 
ester linkages in triglycerides, and are used both in the food industry and for 
effluent treatment. 

The enzyme beta-galactosidase is used industrially for the hydrolysis 
of whey lactose. 

An enzyme known as thermolysin is used in the production of the 
artificial sweetener aspartame. 

An enzyme known as sulphydryl oxidase is used in the reduction of 
the cooked flavor of milk. 

Enzymes known as catalases are used to remove hydrogen peroxides 
from milk, cheese, and egg processing, and are further used in the 
sterilization and oxidation of plastics and rubbers. 

Heparinases are useful for the production of heparin and heparan 
sulfate oligosaccharides. 

Other proteins, in addition to enzymes, are those which have 
affinities to other compounds. For example, bacteria, fungi, plants and 
animals all contain a large number of proteins that exhibit specific 
interactions with agents such as metal ions and toxic compounds, and have 
high affinities for such agents. 

A class of proteins known as metalloproteins contain prosthetic 
groups that bind specifically to metal ions. An example of such a prosthetic 
group is the porphyrin group in hemoglobin. Some other examples of metal 
ion binding proteins include parvalbumin, which binds to calcium, and 
metallothionin, an animal protein that binds large amounts of metal ions, 
especially zinc. Such metal absorptive proteins could also be used for 
purification in industrial processes. 

It is also envisioned that streams of flowing material could be 
degraded by microbial enzymes. It is known that certain pollutants, 
whether natural or synthetic, and certain pesticides and other durable 
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organic compounds in the environment can be degraded (inactivated) or 
converted into useful compounds by microbial enzymes. 

It is known, for example, that some microorganisms, for example 
Pseudomonas putida^ possessed dehalogenases that are capable of 
degrading certain pesticides and herbicides, and rendering them less toxic. 
Similarly, hydrolysis of organophosphate insecticides have been observed 
by microbial enzymes. 

It is also possible to produce antibodies within plant cells. The 
antibodies can include monoclonal antibodies or fragments thereof having 
at least a portion of an antigen binding region, including immunoactive 
entities such as Fv, F(abl)2, Fab fragments (Harlow and Lane, 1988 
Antibody, Cold Spring Harbor), single chain antibodies (U.S. Pat. No. 
4,946,778), chimeric or humanized antibodies (Morrison et al. Proc. Natl. 
Acad. Sci. USA (1984) 81:6851; Neuberger et al Nature (1984) 312:604- 
608) and complementarily determining regions (CDR). 

Another class of proteins are those that bind to antibodies, such as 
protein- A, protein-G, protein-L and their mutants. 

It is also possible to produce protein antibiotics or peptides such as 
lysozyme or therapeutic proteins which might assist in healing processes, 
for example, certain wound healing peptides, growth factors and hormones. 
Proteins such as HSA can also be produced. 

Another class of proteins include proteins such as agglutinin, zein, 
silk, elastine proteins as well as COMP, JUN, FOS and other proteins that 
may form stable protein-protein interactions such as coiled-coil interactions 
that may be useful for production of protein fibers. 

Another example is the production of animal feed enzymes. Phytase 
from Aspergillus niger, for example, increases the availability of 
phosphorus from feed for monogastric animals by releasing phosphate from 
the substrate phytic acid, therefore reducing the need for costly phosphorus 
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supplements. A phytase cDNA was constitutively expressed in transgenic 
tobacco (Nicotiana tabacum) plants (Verwoerd et al. y Plant. Physiol. 
(1995) 109:1199-205). Soybean plants transformed with a fungal phytase 
gene improve phosphorus availability whereas excretion was decreased for 
5 broilers. It appears that phytase can improve growth performance of 
broilers fed low phosphorous diets when provided either as a commercial 
supplement or in the form of transformed seeds (Denbow et al^ Poult. Sci. 
(1989) 77:878.881), 

Other recombinant proteins of interest, will for the most part be 

10 mammalian proteins, and will include blood proteins, such as serum 
albumin, Factor VII, Factor VIIIc, Factor VIIlvW, Factor IX, Factor X, 
tissue plasminogen factor, Protein C, von Willebrand factor, antithrombin 
III, erythropoietin, colony stimulating factors, such as M-, GM-, 
cytokines, such as interleukins 1-11, integrals, addressins, selectins, homing 

15 receptors, surface membrane proteins, such as surface membrane protein 
receptors, T cell receptor units, immunoglobulins (as further detailed above 
with respect to antibodies), soluble major histocompatibility complex 
antigens, structural proteins, such as collagen, fibrin, elastin, tubulin, actin, 
and myosin, growth factor Teceptors, growth factors, growth hormone, cell 

20 cycle proteins, vaccines, fibrinogen, thrombin, cytokines and hyaluronidase. 
Additional examples include chymosin, polymerases, saccharidases, 
dehydrogenases, nucleases, oxido reductases such as fungal peroxidases and 
lactases, xylanases, rennin, horse radish peroxidase, amylases and soil 
remediation enzymes. 

25 The genes encoding all of the above listed proteins have been 

isolated and as such these proteins are readily available for recombinant 
expression and production according to the teachings of the present 
invention. It will be appreciated that new genes encoding an ever growing 
spectrum of proteins are continuously discovered and isolated, rendering 
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such genes available for molecular manipulation and recombinant 
expression. There is thus no intention to limit the recombinant protein 
produced utilizing the method of the present invention to any specific 
protein or list of proteins. 

Cellulose binding peptide-recombinant protein fusions: 

The fusion of two proteins for which genes has been isolated is well 
known and practiced in the art. Such fusion involves the joining together of 
heterologous nucleic acid sequences, in frame, such that translation thereof 
results in the generation of a fused protein product or a fusion proteins. 
Methods, such as the polymerase chain reaction (PCR), restriction, nuclease 
digestion, ligation, synthetic oligonucleotides synthesis and the like are 
typically employed in various combinations in the process of generating 
fusion gene constructs. One ordinarily skilled in the art can readily form 
such constructs for any pair or more of individual proteins. Interestingly, in 
most cases where such fusion or chimera proteins are produced, and in all 
cases where one of the proteins was a cellulose binding peptide, both the 
former and the latter retained their catalytic activity or function. 

For example, Greenwood et al. (1989, FEES Lett. 224:127-131) 
fused the cellulose binding region of Cellulomonas fimi endoglucanase to 
the enzyme alkaline phosphatase. The recombinant fusion protein retained 
both its phosphatase activity and the ability to bind to cellulose. For more 
descriptions of cellulose binding fusion proteins, see U.S. Patent No. 
5,137,819 issued to Kilburn et aL, and U.S. Patent No. 5,719,044 issued to 
Shoseyov et al. both incorporated by reference herein. See also U.S. Pat. 
No. 5,474,925. All of which are incorporated herein by reference. 

The recombinant protein immobilized via its fused counterpart to the 
cellulosic matter can be released from the plant derived cellulosic matter by 
cleavage thereof, e.g., by proteolysis, using either a nonspecific general 
protease such as proteinase K or trypsin, or a specific protease as further 
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detailed hereinunder. For example, release can be effected by treatment 
with proteinase K at a concentration of about 50 p.g/ml for about 20 minutes 
at about 37 °C (Din et al Bio/Technology (1991) 9:1096-1099). 
Inclusion of a dedicated cleavage site: 

According to a preferred embodiment of the present invention the 
fusion protein includes the recombinant protein and the cellulose binding 
peptide separated therebetween via a unique amino acid sequence 
recognizable and digestible by a protease or under predetermined cis or 
trans conditions for digesting a controllable intervening protein sequence.. 

As used herein in the specification and in the claims section that 
follows, the phrase "unique amino acid sequence recognizable and 
digestible by a protease" includes a protease recognition sequence which is 
both recognizable and readily accessible to a protease. Thus, the unique 
sequence can be a solitary sequence (i.e., which does not appear in the 
recombinant protein and optionally also not in the cellulose binding peptide) 
or alternatively, the sole sequence of several similar sequences which is not 
sequestered from the protease due to the tertiary structure of the 
recombinant protein and optionally the cellulose binding peptide. In both 
these cases proteolysis will release the recombinant protein from the fusion 
protein cellulosic matter complex. 

As used herein in the specification and in the claims section that 
follows, the phrase "controllable intervening protein sequence" includes 
unique amino acid sequences capable of excision from or cleavage of a 
peptide under predetermined conditions in cis or in trans, e.g., increase in 
temperature, exposure to light, unblocking of amino acid residues by 
dephosphorylation and treatment with chemical reagents or deglycosylation, 
examples include proteolysis effected under acidic conditions (HQ, e.g., to 
cleave between Asp and Pro) and proteolysis effected by a proteolysing 
reagent, such as CNBr to cleave downstream of Met, all as known in the art 
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and/or as further described in U.S. Pat. No. 5,834,247, which is 
incorporated herein by reference. 

Thus, according to an aspect of the present invention there is 
provided a composition of matter comprising (a) a plant derived cellulosic 
5 matter; and (b) a fusion protein including a recombinant protein and a 
cellulose binding peptide separated therebetween via a unique amino acid 
sequence recognizable and digestible by a protease or under predetermined 
cis or trans conditions for digesting a controllable intervening protein 
sequence, wherein the fusion protein is complexed to the plant derived 
; i 10 cellulosic matter by affinity binding via the cellulose binding peptide. 

Nucleic acid molecules which can be used according to preferred 
^ embodiments of the present invention to express the fusion protein in plant 

! La 

1)1 cells would therefore include a heterologous nucleic acid sequence 

including (i) a first sequence encoding a cellulose binding peptide; (ii) a 
Cn 15 second sequence encoding a recombinant protein, wherein the first and 
□ second sequences are joined together in frame in either orientation; and (iii) 

=.3 

q a third sequence encoding a unique amino acid sequence recognizable and 

digestible by a protease or under predetermined cis or trans conditions for 
digesting a controllable intervening protein sequence, the third sequence is 

20 between and in frame with the first and second sequences. 

Thus, specific cleavage can be used to release the recombinant 
protein from the fusion protein cellulosic matter complex. For example, 
one can include a protease recognition site or a chemical cleavage site 
between the recombinant protein and the cellulose binding peptide. 

25 Examples of recognition sites include those for collagenase, thrombin, 
enterokinase, and Factor X a which are cleaved specifically by the respective 
enzymes. Chemical cleavage sites sensitive, for example, to low pH or 
cyanogen bromide, can also be used. 
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Where cleavage is used, the recombinant protein can be cleaved 
readily from the cellulosic matter by the use of a protease specific for a 
sequence present therebetween and the cellulose binding peptide. 

It will be appreciated in this respect that four main classes of specific 
proteases are known, including (i) cysteine proteases, including cathepsin B 
and L; (ii) aspartyl protease cathepsin D; (iii) serine proteases including 
plasmin, tissue-type plasminogen activator (tPA) and urokinase-type 
plasminogen activator (uPA), and (iv) matrix metal] oproteinases (MMPs), 
including collagenases, gelatinases A and B (MMP2 and MMP9) and 
stromelysin (MMP3). Members of these protease families are commercially 
available and their recognition sequences known. As such, these proteases 
can be used to implement the step of releasing the recombinant protein from 
the plant derived cellulosic matter while implementing the process 
according to the present invention. 

Genetically modified plant material; 

According to an aspect of the present invention there is provided a 
nucleic acid molecule comprising (a) a promoter sequence for directing 
protein expression in plant cells; and (b) a heterologous nucleic acid 
sequence as further detailed herein, wherein, the heterologous nucleic acid 
sequence is down stream the promoter sequence, such that expression of the 
heterologous nucleic acid sequence is effectable by the promoter sequence. 
Such a nucleic acid molecule needs to be effectively introduced into plant 
cells, so as to genetically modify the plant. 

There are various methods of introducing foreign genes into both 
monocotyledonous and dicotyledenous plants (Potrykus, I., Annu. Rev. 
Plant. Physiol., Plant. Mol. Biol. (1991) 42:205-225; Shimamoto et al, 
Nature (1989) 338:274-276). The principle methods of causing stable 
integration of exogenous UNA into plant genomic DNA include two main 
approaches: 
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(i) Agrobacterium-mediatcd gene transfer: Klee et al (1987) 
Annu. Rev. Plant Physiol. 38:467-486; Klee and Rogers in Cell Culture 
and Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant 
Nuclear Genes, eds. Schell, J., and Vasil, L. K. } Academic Publishers, San 

5 Diego, Calif. (1989) p. 2-25; Gatenby, in Plant Biotechnology, eds. 
Kung, S. and Arntzen, C. L, Butterworth Publishers, Boston, Mass. (1989) 
p. 93-112. 

(ii) direct DNA uptake: Paszkowski et al, in Cell Culture and 
Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant 

10 Nuclear Genes eds. Schell, J., and Vasil, L. K., Academic Publishers, San 
Diego, Calif. (1989) p. 52-68; including methods for direct uptake of DNA 
into protoplasts, Toriyama, K. et al (1988) Bio/Technology 6:1072-1074. 
DNA uptake induced by brief electric shock of plant cells: Zhang et al 
Plant Cell Rep. (1988) 7:379-384. Fromm et al Nature (1986) 319:791- 

15 793. DNA injection into plant cells or tissues by particle bombardment, 
Klein et al Bip/Technology (1988) 6:559-563; McCabe et al 
Bio/Technology (1988) 6:923-926; Sanford, Physiol. Plant. (1990) 79:206- 
209; by the use of micropipette systems: Neuhaus et al, Theor. Appl. 
Genet. (1987) 75:30-36; Neuhaus and Spangenberg, Physiol. Plant. (1990) 

20 79:213-2 1 7; or by the direct incubation of DNA with germinating pollen, 
DeWet et al in Experimental Manipulation of Ovule Tissue, eds, 
Chapman, G. P. and Mantell, S. H. and Daniels, W. Longman, London, 
(1985) p. 197-209; and Ohta, Proc. Natl. Acad. Sci. USA (1986) 83:715- 
719. 

25 The Agrobacterium system includes the use of plasmid vectors that 

contain defined DNA segments that integrate into the plant genomic DNA. 
Methods of inoculation of the plant tissue vary depending upon the plant 
species and the Agrobacterium delivery system. A widely used approach is 
the leaf disc procedure which can be performed with any tissue explant that 
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provides a good source for initiation of whole plant differentiation. Horsch 
et al in Plant Molecular Biology Manual A5, Kluwer Academic Publishers, 
Dordrecht (1988) p. 1-9. The Agrobacterium system is especially viable in 
the creation of transgenic dicotyledenous plants. 
5 There are various methods of direct DNA transfer into plant cells. In 

electroporation, the protoplasts are briefly exposed to a strong electric field. 
In microinjection, the DNA is mechanically injected directly into the cells 
using very small micropipettes. In microparticle bombardment, the DNA is 
adsorbed on microprojectiles such as magnesium sulfate crystals or tungsten 
10 particles, and the microprojectiles are physically accelerated into cells or 
plant tissues. 

Following transformation plant propagation is exercised. The most 
common method of plant propagation is by seed. Regeneration by seed 
propagation, however, has the deficiency that due to heterozygosity there is 

15 a lack of uniformity in the crop, since seeds are produced by plants 
according to the genetic variances governed by Mendelian rules. Basically, 
each seed is genetically different and each will grow with its own specific 
traits. Therefore, it is preferred that the transgenic plant be produced such 
that the regenerated plant has the identical traits and characteristics of the 

20 parent transgenic plant, e.g., a reproduction of the fusion protein. 
Therefore, it is preferred that the transgenic plant be regenerated by 
micropropagation which provides a rapid, consistent reproduction of the 
transgenic plants. 

Micropropagation is a process of growing new generation plants 
25 from a single piece of tissue that has been excised from a selected parent 
plant or cultivar. This process permits the mass reproduction of plants 
having the preferred tissue expressing the fusion protein. The new 
generation plants which are produced are genetically identical to, and have 
all of the characteristics of, the original plant. Micropropagation allows 
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mass production of quality plant material in a short period of time and 
offers a rapid multiplication of selected cultivars in the preservation of the 
characteristics of the original transgenic or transformed plant. The 
advantages of cloning plants are the speed of plant multiplication and the 
5 quality and uniformity of plants produced. 

Micropropagation is a multi-stage procedure that requires alteration 
of culture medium or growth conditions between stages. Thus, the 
micropropagation process involves four basic stages: Stage one, initial 
tissue culturing; stage two, tissue culture multiplication; stage three, 

10 differentiation and plant formation; and stage four, greenhouse culturing 
and hardening. During stage one, initial tissue culturing, the tissue culture 
is established and certified contaminant- free. During stage two, the initial 
tissue culture is multiplied until a sufficient number of tissue samples are 
produced to meet production goals. During stage three, the tissue samples 

15 grown in stage two are divided and grown into individual plantlets. At 
stage four, the transgenic plantlets are transferred to a greenhouse for 
hardening where the plants' tolerance to light is gradually increased so that 
it can be grown in the natural environment. 

The basic bacterial/plant vector construct will preferably provide a 

20 broad host range prokaryote replication origin; a prokaryote selectable 
marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacteriwn-medioXcd transfer to plant chromosomes. Where the 
heterologous sequence is not readily amenable to detection, the construct 
will preferably also have a selectable marker gene suitable for determining 

25 if a plant cell has been transformed. A general review of suitable markers 
for the members of the grass family is found in Wilmink and Dons, Plant 
MoL Biol. Reptr. (1993) 11:165-185. 

Sequences suitable for permitting integration of the heterologous 
sequence into the plant genome are also recommended. These might 
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include transposon sequences and the like for homologous recombination as 
well as Ti sequences which permit random insertion of a heterologous 
expression cassette into a plant genome. 

Suitable prokaryote selectable markers include resistance toward 
5 antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known 
in the art. 

The constructs of the subject invention will include an expression 
cassette for expression of the fusion protein of interest. Usually, there will 

10 be only one expression cassette, although two or more are feasible. The 
recombinant expression cassette will contain in addition to the heterologous 
sequence one or more of the following sequence elements, a promoter 
region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a 

15 transcription and translation termination sequence. Unique restriction 
enzyme sites at the 5' and 3' ends of the cassette allow for easy insertion into 
a pre-existing vector. 

Viral infected plant material: 

Viruses are a unique class of infectious agents whose distinctive 
20 features are their simple organization and their mechanism of replication. 
In fact, a complete viral particle, or virion, may be regarded mainly as a 
block of genetic material (either DNA or RNA) capable of autonomous 
replication, surrounded by a protein coat and sometimes by an additional 
membranous envelope such as in the case of alpha viruses. The coat 
25 protects the virus from the environment and serves as a vehicle for 
transmission from one host cell to another. 

Viruses that have been shown to be useful for the transformation of 
plant hosts include CaV, TMV and BV. Transformation of plants using 
plant viruses is described in U.S. Pat, No. 4,855,237 (BGV), EP-A 67,553 
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(TMV), Japanese Published Application No. 63-14693 (TMV), EPA 
194,809 (BV), EPA 278,667 (BV); and Gluzman, Y. et aL, 
Communications in Molecular Biology: Viral Vectors, Cold Spring Harbor 
Laboratory, New York, pp. 172-189 (1988). Pseudovirus particles for use 
in expressing foreign DNA in many hosts, including plants, is described in 
WO 87/06261. 

Construction of plant RNA viruses for the introduction and 
expression of non- viral foreign genes in plants is demonstrated by the above 
references as well as by Dawson, W. O. et al, Virology (1989) 172:285- 
292; Takamatsu et al. EMBO J. (1987) 6:307-311; French et al Science 
(1986) 231:1294-1297; and Takamatsu et al FEBS Letters (1990) 269:73- 
76. 

When the virus is a DNA virus, the constructions can be made to the 
virus itself Alternatively, the virus can first be cloned into a bacterial 
plasmid for ease of constructing the desired viral vector with the foreign 
DNA, The virus can then be excised from the plasmid. If the virus is a 
DNA virus, a bacterial origin of replication can be attached to the viral 
DNA, which is then replicated by the bacteria. Transcription and 
translation of this DNA will produce the coat protein which will encapsidate 
the viral DNA. If the virus is an RNA virus, the virus is generally cloned as 
a cDNA and inserted into a plasmid. The plasmid is then used to make all 
of the constructions. The RNA virus is then produced by transcribing the 
viral sequence of the plasmid and translation of the viral genes to produce 
the coat protein(s) which encapsidate the viral RNA. 

Construction of plant RNA viruses for the introduction and 
expression of non-viral foreign genes in plants is demonstrated by the above 
references as well as in U.S. Pat. No. 5,316,931 

In one embodiment, a plant viral nucleic acid is provided in which 
the native coat protein coding sequence has been deleted from a viral 
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nucleic acid, a non-native plant viral coat protein coding sequence and a 
non-native promoter, preferably the subgenomic promoter of the non-native 
coat protein coding sequence, capable of expression in the plant host, 
packaging of the recombinant plant viral nucleic acid, and ensuring a 
systemic infection of the host by the recombinant plant viral nucleic acid, 
has been inserted. Alternatively, the coat protein gene may be inactivated 
by insertion of the non-native nucleic acid sequence within it, such that a 
fusion protein is produced. The recombinant plant viral nucleic acid may 
contain one or more additional n on -native subgenomic promoters. Each 
non-native subgenomic promoter is capable of transcribing or expressing 
adjacent genes or nucleic acid sequences in the plant host and incapable of 
recombination with each other and with native subgenomic promoters. 
Non-native (foreign) nucleic acid sequences may be inserted adjacent the 
native plant viral subgenomic promoter or the native and a non-native plant 
viral subgenomic promoters if more than one nucleic acid sequence is 
included. The non-native nucleic acid sequences are transcribed or 
expressed in the host plant under control of the subgenomic promoter to 
produce the desired products. 

In a second embodiment, a recombinant plant viral nucleic acid is 
provided as in the first embodiment except that the native coat protein 
coding sequence is placed adjacent one of the non-native coat protein 
subgenomic promoters instead of a non-native coat protein coding 
sequence. 

In a third embodiment, a recombinant plant viral nucleic acid is 
provided in which the native coat protein gene is adjacent its subgenomic 
promoter and one or more non-native subgenomic promoters have been 
inserted into the viral nucleic acid. The inserted non-native subgenomic 
promoters are capable of transcribing or expressing adjacent genes in a 
plant host and are incapable of recombination with each other and with 
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native subgenomic promoters. Non-native nucleic acid sequences may be 
inserted adjacent the non -native subgenomic plant viral promoters such that 
said sequences are transcribed or expressed in the host plant under control 
of the subgenomic promoters to produce the desired product. 
5 In a fourth embodiment, a recombinant plant viral nucleic acid is 

provided as in the third embodiment except that the native coat protein 
coding sequence is replaced by a non-native coat protein coding sequence. 

The viral vectors are encapsidated by the coat proteins encoded by 
the recombinant plant viral nucleic acid to produce a recombinant plant 

10 virus. The recombinant plant viral nucleic acid or recombinant plant virus 
is used to infect appropriate host plants. The recombinant plant viral 
nucleic acid is capable of replication in the host, systemic spread in the host, 
and transcription or expression of foreign gene(s) in the host to produce the 
desired fusion protein. 

15 Fusion protein compartmentalization - signal peptides; 

As already mentioned hereinabove, compartmentalization of the 
fusion protein is an important feature of the present invention because it 
allows undisturbed plant growth. Thus, according to one aspect of the 
present invention, the fusion protein is compartmentalized within cells of 

20 the plant or cultured plant cells, so as to be sequestered from cell walls of 
the cells of the plant or cultured plant cells. 

The fusion protein can be compartmentalized within a cellular 
compartment, such as, for example, the cytoplasm, endoplasmic reticulum, 
golgi apparatus, oil bodies, starch bodies, chloroplastids, chloroplasts, 

25 chromoplastids, chromoplasts, vacuole, lysosomes, mitochondria or the 
nucleus. 

Accordingly, the heterologous sequence used while implementing 
the process according to this aspect of the present invention includes (i) a 
first sequence encoding a cellulose binding peptide; (ii) a second sequence 
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encoding a recombinant protein, wherein the first and second sequences are 
joined together in frame; and (iii) a third sequence encoding a signal peptide 
for directing a protein to a cellular compartment, the third sequence being 
upstream and in frame with the first and second sequences. 
5 The following provides description of signal peptides which can be 

used to direct the fusion protein according to the present invention to 
specific cell compartments. 

It is well-known that signal peptides serve the function of 
translocation of produced protein across the endoplasmic reticulum 

10 membrane. Similarly, transmembrane segments halt translocation and 
provide anchoring of the protein to the plasma membrane, see, Johnson et 
al The Plant Cell (1990) 2:525-532; Sauer et al EMBO J. (1990) 9:3045- 
3050; Mueckler et al Science (1985) 229:941-945. Mitochondrial, nuclear, 
chloroplast, or vacuolar signals target expressed protein correctly into the 

15 corresponding organelle through the secretory pathway, see, Von Heijne, 
Eur. J. Biochem. (1983) 133:17-21; Yon Heijne, J. Mol. Biol. (1986) 
189:239-242; lturriaga et al The Plant Cell (1989) 1:381-390; McKnight et 
al, Nucl. Acid Res. (1990) 18:4939-4943; Matsuoka and Nakamura, Proc. 
Natl. Acad. Sci. USA (1991) 88:834-838. A recent book by Cunningham 

20 and Porter (Recombinant proteins from plants, Eds. C. Cunningham and 
A.J.R. Porter, 1998 Humana Press Totowa, N.J.) describe methods for the 
production of recombinant proteins in plants and methods for targeting the 
proteins to different compartments in the plant cell. In particular, two 
chapters therein (14 and 15) describe different methods to introduce 

25 targeting sequences that results in accumulation of recombinant proteins in 
compartments such as ER, vacuole, plastid, nucleus and cytoplasm. The 
book by Cunningham and Porter is incorporated herein by reference. 
Presently, the preferred site of accumulation of the fusion protein according 
to the present invention is the ER using signal peptide such as Cel 1 or the 
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rice amylase signal peptide at the N-terminus and an ER retaining peptide 
(HDEL, SEQ ID N0:1; or KDEL, SEQ ID NO:2) at the C-terminus. 
Promoters and control of expression: 

Any promoter which can direct the expression of the fusion protein 
5 according to the present invention can be utilized to implement the process 
of the instant invention, both constitutive and tissue specific promoters. 
According to presently preferred embodiment the promoter selected is 
constitutive, because such a promoter can direct the expression of higher 
levels of the fusion protein. In this respect the present invention offers a 

10 major advantage over the teachings of U.S. Pat. No. 5,474,925 in which 
only tissue specific and weak promoters can be employed because of the 
detelerious effect of the fusion protein described therein on cell wall 
development. The reason for which the present invention can utilize strong 
and constitutive promoters relies in the compartmentalization and 

15 sequestering approach which prohibits contact between the expressed fusion 
protein and the plant cell walls which such walls are developing. 

Constitutive and tissue specific promoters, CaMV35S promoter 
(Odeli et al Nature (1985) 313:810-812) and ubiquitin promoter 
(Christensen and Quail, Transgenic research (1996) 5:213-218) are the most 

20 commonly used constitutive promoters in plant transformations and are the 
preferred promoters of choice while implementing the present invention. 

In corn, within the kernel, proteins under the ubiquitin promoters, are 
preferentially accumulated in the germ (Kusnadi et al, Biotechnol. Bioeng. 
(1998) 60:44-52). The amylose-extender (Ae) gene encoding starch- 

25 branching enzyme lib (SBEIIb) in maize is predominantly expressed in 
endosperm and embryos during kernel development (Kim et al Plant. Mol. 
Biol. (1998) 38:945-956). A starch branching enzyme (SBE) showed 
promoter activity after it was introduced into maize endosperm suspension 
cells by particle bombardment (Kim et al Gene (1998) 216:233-243). In 
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transgenic wheat it has been shown that a native HMW-GS gene promoter 
can be used to obtain high levels of expression of seed storage and, 
potentially, other proteins in the endosperm (Blechl and Anderson, Nat. 
Biotechnol. (1996) 14:875-9). Polygalacturonase (PG) promoter was shown 
to confer high levels of ripening-specific gene expression in tomato 
(Nichoiasse/a/. Plant Mol. Biol. (1995) 28:423-435). The ACC oxidase 
promoter (Blume and Grierson, Plant. J. (1997) 12:731-746) represents a 
promoter from the ethylene pathway and shows increased expression during 
fruit ripening and senescence in tomato. The promoter for tomato 3- 
hydroxy-3-methyIglutaryl coenzyme A reductase gene accumulates to high 
level during fruit ripening (Daraselia et al Plant. Physiol. (1996) 1 12:727- 
733). Specific protein expression in potato tubers can be mediated by the 
patath promoter (Sweetlove et al Biochem. J. (1996) 320:487-492), 
Protein linked to a chloroplast transit peptide changed the protein content in 
transgenic soybean and canola seeds when expressed from a seed-specific 
promoter (Falco et al. Biotechnology (NY) (1995) 13:577-82). The seed 
specific bean phaseolin and soybean beta-conglycinin promoters are also 
suitable for the latter example (Keeler et al Plant. Mol. Biol. (1997) 
34:15-29). Promoters that are expressed in plastids are also suitable in 
conjunction with plastid transformation. 

Each of these promoters can be used to implement the process 
according to the present invention. 

Thus, the plant promoter employed can a constitutive promoter, a 
tissue specific promoter, an inducible promoter or a chimeric promoter. 

Examples of constitutive plant promoters include, without being 
limited to, CaMV35S and CaMV19S promoters, FMV34S promoter, 
sugarcane bacilliform badnavirus promoter, CsVMV promoter, Arabidopsis 
ACT2/ACT8 actin promoter, Arabidopsis ubiquitin UBQ1 promoter; barley 
leaf thionin BTH6 promoter, and rice actin promoter. 
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Examples of tissue specific promoters include, without being limited 
to, bean phaseolin storage protein promoter, DLEC promoter, PHSp 
promoter, zein storage protein promoter, conghitin gamma promoter from 
soybean, AT2S1 gene promoter, ACT11 actin promoter from Arabidopsis, 
napA promoter from Brassica napus and potato patatin gene promoter. 

The inducible promoter is a promoter induced by a specific stimuli 
such as stress conditions comprising, for example, light, temperature, 
chemicals, drought, high salinity, osmotic shock, oxidant conditions or in 
case of pathogenicity and include, without being limited to, the light- 
in ducible promoter derived from the pea rbcS gene, the promoter from the 
alfalfa rbcS gene, the promoters DRE, MYC and MYB active in drought; 
the promoters INT, INPS, prxEa, Ha hspl7.7G4 and RD21 active in high 
salinity and osmotic stress, and the promoters hsr303J and str246C active in 
pathogenic stress. 

Expression follow up: 

Expression of the fusion protein can be monitored by a variety of 
methods. For example, ELISA or western blot analysis using antibodies 
specifically recognizing the recombinant protein or its cellulose binding 
peptide counterpart can be employed to qualitatively and/or quantitatively 
monitor the expression of the fusion protein in the plant. Alternatively, the 
fusion protein can be monitored by SDS-PAGE analysis using different 
staining techniques, such as, but not limited to, coomasie blue or silver 
staining. Other methods can be used to monitor the expression level of the 
RNA encoding for the fusion protein. Such methods include RNA 
hybridization methods, e,g., Northern blots and RNA dot blots. 

Binding of the fusion protein to the plant derived cellulosic matter; 

When sufficient expression has been detected, binding of the fusion 
protein to the plant derived cellulosic matter is effected. Such binding can 
be achieved, for example, as follows. Whole plants, plant derived tissue or 
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cultured plant cells are homogenized by mechanical method in the presence 
or absence of a buffer, such as, but not limited to, PBS. The fusion protein 
is therefore given the opportunity to bind to the plant derived cellulosic 
matter. Buffers that may include salts and/or detergents at optimal 
5 concentrations may be used to wash non specific proteins from the 
cellulosic matter. 

Extraction and purification: 

In general, a recent book by Cunningham and Porter (Recombinant 
proteins from plants, Eds. C. Cunningham and AJ.R. Porter, 1998 

10 Humana Press Totowa, N.J,) describes methods for the production of 
recombinant proteins in plants including methods for extraction of the 
proteins from the plants. The methods used herein for extraction of proteins 
from plants are similar, however the ability of the fusion protein to bind to 
cellulose dictates its fate, unless extraction is done under condition in which 

15 the cellulose binding peptide do not bind to cellulose, for example, pH 
higher than 1 0 (for most CBDs) or high concentration of glucose , or 
cellobiose (200 mM or higher) for family IX CBDs. If the initial extraction 
is conducted under conditions that prevent binding, the supernatant is 
cleared from the cellulosic matter and then the solution is brought by either 

20 dilution, dialysis or pH correction, if necessary, to a condition that enables 
binding, after which cellulose is added in a batch or the solution is loaded 
on a cellulose column. Cellulose affinity purification is conducted as 
described, for example, in U.S. Pat. Nos. 5,719,044; 5,670,623; 5,856,021; 
5,137,819; 5,202,247; 5,340,731; and 5,474,925; and U.S. Pat. applications 

25 Nos. 08/788,621; and 08/788,622; EP 0 381 719 Bl, and EP application No. 
93907724.4. Alternatively, the extraction solution provides conditions that 
favor binding to the plant derived cellulosic matter. 

In any case, while the fusion protein is bound to cellulose, further 
whases can be employed for further removal of unbound proteins, 
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conditions which dissociate such binding or proteolytic cleavage can be 
used to isolate the fusion protein itself, or proteolytic cleavage can be used 
to isolate the recombinant protein, all as further detailed hereinabove. 

5 Additional objects, advantages, and novel features of the present 

invention will become apparent to one ordinarily skilled in the art upon 
examination of the following examples, which is not intended to be 
limiting. Additionally, each of the various embodiments and aspects of the 
present invention as delineated hereinabove and as claimed in the claims 

10 section below finds experimental support in the following examples. 

EXAMPLES 



"~ Reference is now made to the following examples, which together 

15 with the above descriptions, illustrate the invention in a non limiting 

j!* fashion. Generally, the nomenclature used herein and the laboratory 

^ procedures in recombinant DNA technology described below are those well 

":=» 

known and commonly employed in the art. Standard techniques are used 
for cloning, DNA and RNA isolation, amplification and purification. 

20 Generally enzymatic reactions involving DNA ligase, DNA polymerase, 
restriction endonucleases and the like are performed according to the 
manufacturers' specifications. These techniques and various other 
techniques are generally performed according to Sambrook et al, molecular 
Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

25 Spring Harbor, N.Y. (1989). Other general references are provided 
throughout this document. The procedures therein are believed to be well 
known in the art and are provided for the convenience of the reader. All the 
information contained therein, as well as that contained in the Manual by 
Sambrok et ai, is incorporated herein by reference. 
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EXAMPLE 1 

Expression of CBD-Protein-L in transgenic potato tubers 
Constructs: 

5 A vector containing the class-I patatin gene B33 promoter (Olesinski 

et al, 1996, Plant Phisiol. 111:541-550), fused to the cell signal sequence 
(Shani et al, 1997, Plant Mol. Biol. 34:837-842), protein-L (hereinafter 
ProtL, Nilson et al, 1993, J. Immunol. Meth. 164:33-40), CBD cex 
sequence (Gilkes et al, 1991, Microbiol. Rev. 55:303-315) and HDEL 
£3 io (SEQ ID NO:l) retaining peptide sequence was constructed as follows. 

A DNA fragment encoding the cell signal peptide (nucleotide 1-105 
Q of SEQ ID NO:3) was generated by PCR using the following primers: No. 1 

13 (BgHI) 5'-AAAAAGATCTATGGCGCGAAAATCACTAA-3' (SEQ ID 

r NO:4) and No. 2 (Xbal) 5'-AAAATCTAGATTTACGGAGAGCGTCGCG- 

m 15 3' (SEQ ID NO: 5). A DNA fragment encoding the ProtL-CBDcex 
K (nucleotides 3-1280 of SEQ ID NO:6) was generated by PCR using the 

following primers: No. 3 (Xbal) 5-AAAATCTAGAATGGCG 
GCGGTAGAAAATAAAG-3 t (SEQ ID NO:7); and No. 4 (HDEL, Stop 
and Sail) 5'-AAAAGTCGACTTAAAGTTCATCATGCTCGACGCCGA 
20 CCGTGCA-3' (SEQ ID NO:8). The two fragment were digested with 
Bglll, Xbal and Sail and ligated in one step into pUC19 (New England 
Biolabs, Beverly, Massachusetts) that was pre digested with BamHI and 
Sail. The primer for the c-terminal end of the ProtL-CBD contained the 
retaining peptide sequence HDEL (SEQ ID NO:l) and a stop codon in 
25 frame. The DNA containing the Cell Signal-ProtL-CBD-HDEL 
(hereinafter, ProtL-CBD) fusion was excised using Smal and Sail and was 
subcloned into the Smal and Sail sites of the binary vector Binl9 (Bevan, 
1984, Nuc. Acid Res. 12:8711-8721) under the class-I patatin gene B33 
promoter (Olesinski et al, 1996, Plant Phisiol. 1 1 1:541-550). 
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Construction of transgenic plants: 

The binary vector was mobilized into disarmed LB 4404 
Agrobacterium tumefaciens (An, 1987, Meth. Enzymol. 153:292-305). 
Tuber discs transformation was performed with Solanum tuberosum cv 
5 Desiree plants as described previously (Olesinski et al, 1996, 1996, Plant 
Phisiol. 111:541-550). Regenerated transgenic plants were selected on 
kanamycin. 

Analysis of transgenic plants: 

Transformant plants were selected by rooting on kanamycin (100 
10 mg/L), Growth conditions in growth room and green house were as 
described previously (Olesinski et al, 1996, 1996, Plant PhisioL 111:541- 
550). 

Transgenic plants transformed with the ProtL-CBD construct can be 
tested for expression of ProtL-CBD. Thus, for example, a Northern blot 

15 analysis of ProtL-CBD RNA expression can be carried out using the DNA 
fragment of SEQ ID NO:3 or a portion thereof as a probe. Furthermore, 
RT-PCR can be performed on RNA samples obtained from such transgenic 
plants using PCR primers No. 3 and No. 4 to amplify and detect the ProtL- 
CBD RNA. Additional methods for analysis of expression in transgenic 

20 plants include RNase protection assays. Expression of the ProtL-CBD 
fusion protein can alternatively or additionally be evaluated by monitoring 
the fusion protein itself using, for example, ELISA or Western blot 
protocols and anti-CBD or ProtL antibodies. Yet in addition, the ProtL- 
CBD fusion protein can be visualized by SDS-PAGE analysis using 

25 different staining techniques, such as coomasie blue or silver staining, 
immuno precipitation, enzyme linked immunoassays or CBD and ProtL 
binding assays. In addition, techniques such as hi situ hybridization and 
immunostaning also may be used to detect the presence or expression of the 
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recombinant construct in specific plant organs and tissues. The methods for 
performing all of these assays are well known to those skilled in the art. 

Although the invention has been described in conjunction with 
specific embodiments thereof, it is evident that many alternatives, 
modifications and variations will be apparent to those skilled in the art. 
Accordingly, it is intended to embrace all such alternatives, modifications 
and variations that fall within the spirit and broad scope of the appended 
claims. All publications cited herein are incorporated by reference in their 
entirety. 
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SEQUENCE LISTING 



ID 



GENERAL INFORMATION ; 



(i) 
(ii> 



(iii> 
(iv) 



(v) 



<vi> 



Wii> 



<viii) 



fix) 



APPLICANT: 

TITLE OF INVENTION: 



NUMBER OF SEQUENCES: 
CORRESPONDENCE ADDRESS; 



(A) 
(B) 
fC) 
CD) 
(E) 
(F) 



ADDRESSEE: 
STREET t 
CITY: 
STATE : 
COUNTRY: 
ZIP; 



COMPUTER READABLE FORM: 



{A) 
(B) 
(C> 

(D) 



MEDIUM TYPE: 
COMPUTER: 
OPERATING SYSTEM: 

SOFTWARE : 



2iv Shani et al . 

PROCESS OF EXPRESSING AND ISOLATING 
RECOMBINANT PROTEINS AND RECOMBINANT 
PROTEIN PRODUCTS FROM PLANTS, PLANT 
DERIVED TISSUES OR CULTURED PLANT CELLS 
43 

Mark M. Friedman c/o Anthony Castorina 
2001 Jefferson Davis Highway, Suite 20 7 
Arlington 
Virginia 

United States of America 
22202 

1.44 megabyte, 3,5" microdisk 
Twinhead® Slimnote-890TX 
MS DOS version 6.2, 
windows version 3.11 

Word for Windows version 2.0 converted to 
an ASCI file 



CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

(B> FILING DATE: 

(C) CLASSIFICATION ; 

PRIOR APPLICATION DATA: 
(A) APPLICATION NUMBER: 

<B) FILING DATE; 

ATTORNEY/AGENT INFORMATION: 

(A) NAME : 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 
TELECOMMUNICATI ON INFORMATION : 
tA> TELEPHONE: 

(B) TELEFAX: 

(C) TELEX: 



Friedmam, Mark M. 

33, 883 

1079/5 

972-3-5625553 
972-3-5625554 



INFORMATION FOR SEQ ID NO-.l 



SEQUENCE CHARACTERISTICS; 



(A) 
(B) 
(C) 
(D) 



LENGTH: 
TYPE: 

STRANDEDNESS: 
TOPOLOGY : 



[xi) 

His Asp Glu Leu 
4 



SEQUENCE DESCRIPTION: 



amino acid 

single 

linear 

SEQ ID NO;l 



(2) 



INFORMATION FOR SEQ ID N0:2: 
(i} SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 4 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
SEQUENCE DESCRIPTION; SEQ ID NO: 2- 



Lys Asp Glu Leu 
4 
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(2) INFORMATION FOR SEQ ID NO: 3: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
ATGGCGCGAA AATCCCTAAT TTTCCCGGTG ATTTTGCTCG CCGTTCTTCT 50 
CTTCTCTCCG CCGATTTACT CCGCCGGTCA CGATTACCGC GACGCTCTCC 100 
GTAAAAGCAT TCTCTTCTTC GAAGGTCAAC GTTCCGGTAA ACTCCCTCCA 150 

(2) INFORMATION FOR SEQ ID N0:4: 

(i} SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 29 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 3D NO:4: 
AAAAAGATCT ATGGCGCGAA AATCACTAA 2 9 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

IA) LENGTH: 2 9 

(B) TYPE: nucleic acid 

(C) STRANDEdNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : S : 
AAAATCTAGA TTTACGGAGA GCGTCGCG 29 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 





(A) 


LENGTH : 


1305 








(B> 


TYPE: 


nucleic acid 






(O 


STRANDEDNESS : double 








{D) 


TOPOLOGY: 


1 inear 






(xi) SEQUENCE DESCRIPTION: SEQ ID 


N0:6: 




CCATGGCGGC 


GGTAGAAAAT 


AAAGAAGAAA 


CACCAGAAAC 


ACCAGAAACT 


50 


GATTCAGAAG 


AAGAAGTAAC 


AATCAAAGCT 


AACCTAATCT 


TTGCAAATGG 


100 


AAGCACACAA 


ACTGCAGAAT 


TCAAAGGAAC 


ATTTGAAAAA 


GCAACATCAG 


150 


AAGCTTATGA 


GTATGCAGAT 


ACTTTGAAGA 


AAGACAATGG 


AGAATATACT 


200 


GTAGATGTTG 


CAGATAAAGG 


TTATACTTTA 


AATATTAAAT 


TTGCTGGAAA 


250 


AGAAAAAACA 


CCAGAAGAAC 


CAAAAGAAGA 


AGTTACTATT 


aaagcaaact 


300 


TAATCTATGC 


AGATGGAAAA 


AC AC AAA C AG 


CAGAATTCAA 


AGGAACATTT 


350 


GAAGAAGCAA 


CAGCAGAAGC 


ATACAGATAT 


GCAGATG CAT 


TAAAGAAGGA 


400 


CAATGGAGAA 


TATACAGTAG 


ACGTTGCAGA 


TAAAGGTTAT 


ACTTTAAATA 


450 


TTAAATTTGC 


TGGAAAAGAA 


AAAACACCAG 


AAGAACCAAA 


AGAAGAAGTT 


500 


ACTATTAAAG 


CAAACTTAAT 


CTATG CAGAT 


GGAAAAACAC 


AAACAGCAGA 


550 


ATTCAAAGGA 


ACATTTGAAG 


AAGCAACAGC 


AGAAGCATAC 


AGATATGCTG 


600 


ACTTATTAGC 


AGCAAAAGAA 


AATGGTAAAT 


ATACAGTAGA 


CGTTGCAGAT 


650 


AAAGGTTATA 


CTTTAAATAT 


TAAATTTGCT 


GGAAAAGAAA 


AAACACCAGA 


700 


AGAACCAAAA 


GAAGAAGTTA 


CTATTAAAGC 


AAACTTAATC 


TATGCAGATG 


750 


GAAAAACTCA 


AACAGCAGAG 


TTCAAAGGAA 


CATTTGCAGA 


AGCAACAGCA 


800 


GAAGCATACA 


GATACGCTGA 


CTTATTAGCA 


AAAGAAAATG 


GTAAATATAC 


350 


AGCAGACTTA 


GAAGATGGTG 


GATACACTAT 


TAATATTAGA 


TTTGCAGGTA 


900 


AGAAAGTTGA 


CGAAAAACCA 


GAAGGGATCC 


CTCCGACGCC 


GACCCCGACT 


950 


AGTGGTCCGG 


CCGGGTGCCA 


GGTGCTGTGG 


GGCGTCAACC 


AGTGGAACAC 


1000 


CGGCTTCACC 


GCGAACGTCA 


CCGTGAAGAA 


CACGTCCTCC 


GCTCCGGTAG 


10S0 


ACGGCTGGAC 


GCTCACGTTC 


AGCTTCCCGT 


CCGGCCAGCA 


GGTCACCCAG 


1100 
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GCGTGGAGCT CGACGGTCAC GCAGTCCGGC TCGGCCGTGA CGGTCCGCAA 11S0 
CGCCCCGTGG AACGGCTCGA TCCCGGCGGG CGGCACCGCG CAGTTCGGCT 1200 
TCAACGGCTC GCACACGGGC ACCAACGCCG CGCCGACGGC GTTCTCGCTC 1250 
AACGGCACGC CCTGCACGGT CGGCGTCGAG CACC&CCACC ACCACCACCA 1300 
CCACI 1305 



(2) 



INFORMATION FOR SEQ ID NO: 7: 



(Xi) 



SEQUENCE CHARACTERISTICS: 



(A) 
(S) 
(C) 
(D) 



LENGTH: 
TYPE: 

STRAND EDNES S : 
TOPOLOGY : 



SEQUENCE DESCRIPTION: 



32 

nucleic acid 

single 

1 inear 

SEQ ID N0:7: 



AAAATCTAGA ATGGCGGCGG TAGAAAATAA AG 32 



(2) 



INFORMATION FOR SEQ IJD NO: 



ti) 



(xi) 



SEQUENCE CHARACTERISTICS: 



(A} 
<B? 
(C) 
CD) 



LENGTH: 
TYPE: 

STRANDEDNESS : 
TOPOLOGY ; 



SEQUENCE DESCRIPTION: 



43 

nucleic acid 

single 

linear 

SEQ ID NO: 8: 



AAAAGTCGAC TTAAAGTTCA TCATGCTCGA CGCCGACCGT GCA 43 
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WHAT IS CLAIMED IS: 



1 . A process of expressing a recombinant protein in a plant and 
of isolating the recombinant protein from the plant, the process comprising 
the steps of: 

(a) providing a plant, a plant derived tissue or cultured plant cells 
expressing a fusion protein including the recombinant protein 
and a cellulose binding peptide being fused thereto, said 
fusion protein being compartmentalized within cells of said 
plant, plant derived tissue or cultured plant cells, so as to be 
sequestered from cell walls of said cells of said plant, plant 
derived tissue or cultured plant cells; 

(b) homogenizing said plant, plant derived tissue or cultured plant 
cells, so as to bring into contact said fusion protein with a 
plant derived cellulosic matter of said plant, plant derived 
tissue or cultured plant cells, to thereby effect affinity binding 
of said fusion protein via said cellulose binding peptide to said 
cellulosic matter, thereby obtaining a fusion protein cellulosic 
matter complex; and 

(c) isolating said fusion protein cellulosic matter complex, 

2. The process of claim 1 , further comprising the step of: 

(d) washing said fusion protein cellulosic matter complex, thereby 
removing endogenous plant proteins and other plant material 
therefrom. 

3 . The process of claim 2, further comprising the step of: 
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(e) collecting said fusion protein cellulosic matter complex as a 
final product of the process, 

4. The process of claim 2, further comprising the step of: 

(e) exposing said fusion protein cellulosic matter complex to 
conditions effective in dissociating said fusion protein from 
said cellulosic matter; and 

(f) isolating said fusion protein, thereby obtaining an isolated 
fusion protein. 

5. The process of claim 4, wherein said conditions effective in 
dissociating said fusion protein from said cellulosic matter are selected from 
the group consisting of basic conditions, denaturative conditions and 
affinity displacement conditions. 

6. The process of claim 4, further comprising the step of: 

(g) exposing said isolated fusion protein to conditions effective in 
digesting said fusion protein so as to release said recombinant 
protein therefrom, thereby obtaining a released recombinant 
protein. 

7. The process of claim 6, wherein said conditions effective in 
digesting said fusion protein so as to release said recombinant protein 
therefrom are selected from the group consisting of proteolysis effected via 
a protease and proteolysis effected under predetermined cis or trans 
conditions for digesting a controllable intervening protein sequence. 

8. The process of claim 4, further comprising the step of: 

(h) isolating said released recombinant protein. 
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9. The process of claim 2, further comprising the step of: 

(e) exposing said fusion protein cellulosic matter complex to 
conditions effective in digesting said fusion protein so as to 
release said recombinant protein therefrom, thereby obtaining 
a released recombinant protein. 

10. The process of claim 6, wherein said conditions effective in 
digesting said fusion protein so as to release said recombinant protein 
therefrom are selected from the group consisting of proteolysis effected via 
a protease and proteolysis effected under predetermined cis or trans 
conditions for digesting a controllable intervening protein sequence. 

1 1 . The process of claim 4, further comprising the step of: 

(f) isolating said released recombinant protein. 

12. A genetically modified or viral infected plant or cultured plant 
cells expressing a fusion protein including a recombinant protein and a 
cellulose binding peptide, said fusion protein being compartmentalized 
within cells of said plant or cultured plant cells, so as to be sequestered from 
cell walls of said cells of said plant or cultured plant cells. 

13. The genetically modified or viral infected plant or cultured 
plant cells of claim 12, wherein expression of said fusion protein is under a 
control of a constitutive or tissue specific plant promoter. 

14. The genetically modified or viral infected plant or cultured 
plant cells of claim 12, wherein said fusion protein is compartmentalized 
within a cellular compartment selected from the group consisting of 
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cytoplasm, endoplasmic reticulum, golgi apparatus, oil bodies, starch 
bodies, chloroplastids, chloroplasts, chromoplastids, chromoplasts, vacuole, 
lysosomes, mitochondria, and nucleus. 

15. A genetically modified or viral infected plant or cultured plant 
cells expressing a fusion protein including a recombinant protein and a 
cellulose binding peptide separated therebetween via a unique amino acid 
sequence recognizable and digestible by a protease or under predetermined 
cis or trans conditions for digesting a controllable intervening protein 
sequence. 

16. The genetically modified or viral infected plant or cultured 
plant cells of claim 15, wherein said fusion protein is compartmentalized 
within cells of said plant or cultured plant cells, so as to be sequestered from 
cell walls of said cells of said plant or cultured plant cells. 

17. The genetically modified or viral infected plant or cultured 
plant cells of claim 15, wherein said fusion protein is compartmentalized 
within a cellular compartment selected from the group consisting of 
cytoplasm, endoplasmatic rericulum, golgi apparatus, oil bodies, starch 
bodies, chloroplastids, chloroplasts, chromoplastids, chromoplasts, vacuole, 
lysosomes, mitochondria, and nucleus. 

18. The genetically modified or viral infected plant or cultured 
plant cells of claim 15, wherein expression of said fusion protein is under a 
control of a constitutive or tissue specific plant promoter. 

19. A composition of matter comprising: 

(a) a plant derived cellulosic matter of a plant; and 
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(b) a fusion protein including a recombinant protein and a 
cellulose binding peptide separated therebetween via a unique 
amino acid sequence recognizable and digestible by a protease 
or under predetermined cis or trans conditions for digesting a 
controllable intervening protein sequence, said fusion protein 
being expressed in said plant and complexed to said plant 
derived cellulosic matter of said plant by affinity binding via 
said cellulose binding peptide. 

20. A nucleic acid molecule comprising: 

(a) a promoter sequence for directing protein expression in plant 
cells; 

(b) a heterologous nucleic acid sequence including: 

(i) a first sequence encoding a cellulose binding peptide; 

(ii) a second sequence encoding a recombinant protein, 
wherein said first and second sequences are joined 
together in frame; and 

(iii) a third sequence encoding a unique amino acid 
sequence being recognizable and digestible by a 
protease or under predetermined cis or trans conditions 
for digesting a controllable intervening protein 
sequence, said third sequence being between and in 
frame with said first and second sequences; 

wherein, said heterologous nucleic acid sequence being down stream said 
promoter sequence, such that expression of said heterologous nucleic acid 
sequence is effectable by said promoter sequence. 

21. The nucleic acid molecule of claim 20, further comprising a 
sequence element selected from the group consisting of an origin of 
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replication for propagation in bacterial cells, at least one sequence element 
for integration into a plant's genome, a polyadenylation recognition 
sequence, a transcription termination signal, a sequence encoding a 
translation start site, a sequence encoding a translation stop site, plant RNA 
virus derived sequences, plant DNA virus derived sequences, tumor 
inducing (Ti) plasmid derived sequences, and a transposable element 
derived sequence. 

22. A nucleic acid molecule comprising: 

(a) a promoter sequence for directing protein expression in plant 
cells; 

(b) a heterologous nucleic acid sequence including: 

(i) a first sequence encoding a cellulose binding peptide; 

(ii) a second sequence encoding a recombinant protein, 
wherein said first and second sequences are joined 
together in frame; and 

(iii) a third sequence encoding a signal peptide for directing 
a protein to a cellular compartment, said third sequence 
being upstream and in frame with said first and second 
sequences; 

wherein, said heterologous nucleic acid sequence being down stream said 
promoter sequence, such that expression of said heterologous nucleic acid 
sequence is effectable by said promoter sequence. 

23. The nucleic acid molecule of claim 22, wherein said 
heterologous nucleic acid sequence further includes 

(iv) a fourth sequence encoding a unique amino acid 
sequence being recognizable and digestible by a 
protease or under predetermined cis or trans conditions 
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for digesting a controllable intervening protein 
sequence, said fourth sequence being between and in 
frame with said first and second sequences. 

24. The nucleic acid molecule of claim 22, further comprising a 
sequence element selected from the group consisting of an origin of 
replication for propagation in bacterial cells, at least one sequence element 
for integration into a plant's genome, a polyadenylation recognition 
sequence, a transcription termination signal, a sequence encoding a 
translation start site, a sequence encoding a translation stop site, plant RNA 
virus derived sequences, plant DNA virus derived sequences, tumor 
inducing (Ti) plasmid derived sequences, and a transposable element 
derived sequence. 

25. The nucleic acid molecule of claim 22, wherein said cellular 
compartment is selected from the group consisting of cytoplasm, 
endoplasmic reticulum, golgi apparatus, oil bodies, starch bodies, 
chloroplastids, chloroplasts, chromoplastids, chromoplasts, vacuole, 
lysosomes, mitochondria, and nucleus. 



ABSTRACT OF THE DISCLOSURE 
A process of expressing a recombinant protein in a plant and of 
isolating the recombinant protein from the plant, the process is effected by 
(a) providing a plant, a plant derived tissue or cultured plant cells 
expressing a fusion protein including the recombinant protein and a 
cellulose binding peptide being fused thereto, the fusion protein being 
compartmentalized within cells of the plant, plant derived tissue or cultured 
plant cells, so as to be sequestered from cell walls of the cells of the plant, 
plant derived tissue or cultured plant cells; (b) homogenizing the plant, plant 
derived tissue or cultured plant cells, so as to bring into contact the fusion 
protein with a cellulosic matter of the plant, plant derived tissue or cultured 
plant cells, to thereby effect affinity binding of the fusion protein via the 
cellulose binding peptide to the cellulosic matter, thereby obtaining a fusion 
protein cellulosic matter complex; and (c) isolating the fusion protein 
cellulosic matter complex. 
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Combined Declaration For Patent Application and Power of Attorney 



As a below named inventor, t hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name; 

i believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor {if plural names are listed below) of the subject matter which is claimed and for which a patent is sought on the 
invention entitled process of expressing and isolating recombinant proteins and recombinant protein products 

FROM PLANTS. PLANT DERIVED TISSUES OR CULTURED PLANT CELLS , the Specification of which 

(check one) £3 is attached hereto 

□ was filed on as Application Serial No. and was amended on . I hereby state that I 
have reviewed and understand the contents of the above identified specification, including the claims, as amended by any 
amendment referred to above 

I acknowledge the duty to disclose information which is material to the patentability of this application in accordance 
with Title 37, Code of Federal Regulations. § 1 56(a) 

I hereby claim foreign priority benefits under Title 35, United States Code, § 1 19 of any foreign application(s) for 
patent or inventor's certificate listed below and have also identified below any foreign application for patent or inventor's 
certificate having filing date before that of the application on which priority is claimed: 



Prior Foreign Application(s) 
NONE 



□ 



(number) 



(Country) 



(Day, Month, Year Filed) 



(number) (Country) 



(Day, Month, Year Filed) 



(number) (Country) 



(Day, Month, Year Filed) 



Priority Claimed 

□ □ 
Yes 

□ 
Yes 

O 
Yes 



No 

□ 
No 

□ 
No 



i hereby claim the benefit under Title 35, United States Code, § 120 of any United States Appiication(s) listed 
bjefow and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior United 
&|tes application in the manner provided by the first paragraph of Title 35, United States code, § 112, 1 acknowledge 
tfij duty to disclose material information as defined in Title 37, Code of Federal Regulations, § 1.56(a) which occurred 
between the filing date of the prior application and the national or PCT international filing date of this application: 



(Application Serial No.) 
(Application Serial No.) 



(Filing Date) Status 

(patented, pending, abandoned) 

(Filing Date) Status 

{patented, pending, abandoned) 



I hereby appoint the following attorneys, with full power of substitution, association, and revocation, to 
prosecute this application and to transact all business in the Patent and Trademark Office connected therewith. 

Mark M, Friedman Registration No. 33,883 

Address all Correspondence to: 



DR. MARK FRIEDMAN LTD. 

c/o ANTHONY CASTORINA 

2001 JEFFERSON DAVIS HIGHWAY 

SUITE 207 

ARLINGTON. VIRGINIA 22202 



Direct all telephone calls & faxes to: 
ROBERT SHEINBEIN 
Phone (703)415-1581 
Fax (703)415-4864 
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Continuation of Comoined Declaration For Patent Application and Power of Attorney 



I hereby further declare that all statements made herein of my own knowledge are true and thai a)) statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge that willfui false 
statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Tftle 18 of the United 
States Code and that such willful false statement may jeopardize the validity of the application of any patent issued thereon. 



| 'FULL NAME OF SOLE OR FIRST INVENTOR 
I ZIVSHANI 


| INVENTOR'S SIGNATURE 
...I 




I DATE, , 


1 
J 


| RESIDENCE 

I 4 PALDL 76248 REHOVOT ISRAEL 


{ CITIZENSHIP 
I ISRAELI 






1 
1 


(POST OFFICE ADDRESS ( 
I 4 PALDI. 76248 REHOVOT. ISRAEL I 



I "FULL NAME OF SECOND INVENTOR 
1 OPED SHOSEYOV 



) INVENTOR'S SIGNATURE 
J 



NSH1P Jj I 7 \ 



I RESIDENCE 

\ IS HAEREZ. 729 1 0 KARME YOSEF. ISRAEL 



CITIZENSHIP 
ISRAELI 



{ POST OFFICE ADDRESS 
b ^HAEREZ 72910 KARME YOSEF. ISRAEL 



I "FULL NAME OF FOURTH INVENTOR 

Lm 



| INVENTOR'S SIGNATURE 



| DATE 



I RESIDENCE 



J CITIZENSHIP 

J. ISRAELI 



| POST OFFICE ADDRESS 

I " 



i 'HULL NAME OF FIFTH INVENTOR 
L-LZ 



| INVENTOR S SIGNATURE 



DATE 



RESIDENCE 

5, i 



| CITIZENSHIP 
| ISRAELI 



| POST OFFICE ADDRESS 



| "FULL NAME OF SIXTH INVENTOR 
I 



| INVENTOR'S SIGNATURE 
J : 



I DATE 
J 



RESIDENCE 



| CITIZENSHIP 
I ISRAELI 



POST OFFICE ADDRESS 



| "FULL NAME OF SEVENTH INVENTOR 
I 



J INVENTOR'S SIGNATURE 
J 



DATE 



| RESIDENCE 
1 



CITIZENSHIP 
ISRAEL I , 



POST OFFICE ADDRESS 



SMALL BUSINESS CONCERN - NEW APPLICATION 

Attorney Docket No.. 1 079/5 



IN THE UNfTEP STATES PATENT AND TRADEMARK OFFICE 



In RE Application of. ZIVSHANI ETAL 
Filed Concurrently Herewith 



For PROCESS OF EXPRESSING AND ISOLATING RECOMBINANT PROTEINS AND RECOMBINANT PROTEIN PRODUCTS PROM 
PLANTS, PLANT DERIVED TISSUES OR CULTURED PLANT CELLS 



VERIFIED STATEMENT UNDER 37 CFR 1 27 
CLAIMING STATUS AS A SMALL ENTITY 



To The Commissioner of Patents and Trademarks: 



I hereby declare that 



I am the owner of. or an official empowered to act on behalf of, the small business concern identified below: 



Name of Concern: CBD TECHNOLOGIES LTD. 

^ Address : 2 PEKERIS. 4th FLOOR. PARK TAMAR P O BOX 199. 76100 REHOVOT. ISRAEL 

*.. 3 

{.sjThe smail business concern identified above, together with its affiliates, employs fewer than 500 persons and qualifies as a small business 
concern as defined in 37 CFR 1 .9(d) for purposes of paying reduced fees under 35 USC § 41(a) and § 41<b) to the Patent and Trademark Office 
wifrfregard to the above-entttied invention descnfced in the specification filed herewith. 

f If Rights under contract or law have been conveyed to and remain with the small business concern identified above with regard to the above 
entitled invention. 

ti! *lf the rights held by the small business concern are not exclusive, each other party having rights to the invention is listed below, and no rights 
to the invention are held by any party who could not qualify as a small entity under 37 CFR 1.9(0, namely any person who could not be classified 
asf|h independenl inventor under 37 CFR 1.9(c) if that person had made the invention, or any concern which would net qualify as a small 
bustnsss'concem under 37 CFR 1.9(d) or a nonprofit organization under 37 CFR 1.3(e). 

\A 

Q Full Name (Party 1 ) : NONE 

/- Address : „ 

Status □ Jr*dividu«t D Small Business Q Nonprofit 

Concern Organization 



Full Name (Party 2} 
Address 



Status □ Individual □ Small Business Q Nonprofit 

Concern Organization 
acknowledge the duty under 37 CFR 1.28(b) to file, m this application, notification of any change in status resulting in loss of entitlement to smafi 
entity status prior to paying, or at the time of paying, the issue tee due after the date on which status as a small entity is no longer appropriate. 

I hereby dedare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true, and further that these statements were made with the Knowledge that willful False statements and the like so made are 
punishable by fine or imprisonment, or both. under^ctionX^ai df Title 18 of the United States Code, and that such willful false statements may 
jeopardize the validity of the application and any own^kaymg tHereon. 

"0ft. STPiNtF/ UlftSC tf- A/IMA / . ^^ t \W 

Name of Person Signing /S&nibire Date 



Capacity of Person Signing: 



Address of Person Signing : £ PCKER tS t ftEVtoVoT ( \ SftftQ 



** ^ 7. JUiN. 1999^ = 12: 45 ' <YISSUn Dai >* *H- 'Ait*'** 1 

972 3 5S25554 



NO. 515 P.2/2---^v t 



SMALL BUSINESS CONCERN - NEW APPLICATION 



Attorney Docket No.: 1079/s 



IN, TK5 UNITgp states patent and TftftPFMfV** Office 



in RE Application of: zivshani ft al. 
Filed Concurrently Herewfth 
Fnr, PROfiFflfi QP EXPRESSING AND I5QI 



L AT I NS 



RECOMBINANT P ROTEINS AMD RECOMBINANT PROTEIN PRODUCTS FROM" 



PLANTS. PLANT PEfifVED TISSUES OR CL>CTVRED>UNTJ^^ 

' V^R|P|qpSTATeMeWT UNDER 37 CFR 1.27 
CLAIMING STATUS AS A SMALL ENTITY 

To The Commissioner of Patents and Trademarks; 

I hereby declare that 

I am tht owner of. or an official empo*«r*d to act on bahalf of, the small business concern Identified b*low. . 
Name of Concern: YISSUM RESEARCH AND DEV E LOPMENT COMPANY OF THE HEBREW UNIVERSITY OF JERUSALEM 
3 Address ,46 JABOTINSKI. 91 043 JERUSALEM. ISRAEL 



and qualifies as a small business 
to the Patent and Trademark Offic* 



^The small business concern Identified above, together with its affiliates, employs fewer than SOO persons 
concern as defined in 37 CFR I.S(d) for purposes of paying reduced fees under 35 USC | *1<a) *nd f 41(b) b 
Yv»th=rjgard 1o the above-entitled invention described In the specification fil«d herewith. 

! Rights under contract or low have been conveyed to and remain with the small business concern Identified above with regard to the above 
entlijei Invention. 

If the rights rmid by the small business concern are not exclusive, oach other party having nghts to the Invention « listed below, and no rights 
to the trtventlon are held by any party who could not qualify as a small entity under 37 CFR 1.6(0, namely any person who could not be classified 
as sn&dspendent Inventor under 37 CFR t,P(c) If that peraon had med« the Invention, or any concern which weu|d not quarry as « small 
business concern under 37 CFR 1 .9(d) or a nonprofit organization under 37 CFR 1.9(e). 



Full Name (Party 1 ] : NQNS 

Address : — ' 

Status □ Individual 



□ Small Buslnasa 
Concern 



□ Nonprofit 
Organisation 



Full Name (Party 2) 
Address 



Status 



□ Individual □ Small Business . P Nonprofit 

Concern Organisation 
dmowledse the duty undsr 37 CFR 1 .28(b) to file, fn this application, notification of a ny change m status resulting In loss of entitlement to small 
entity status prior lo paying, or at the time of paying, the Issue fee due after the date on which statu* a* a email entity |s no longer appropriate. 



I hereby daelare that all statements mado herein ottn 
believed to be true; and further that these statamen$s v yfe/e 
punishable by fine or lmpnsonm«nt. or both, under Sec 
Jeopardize the validity of the application and any patent 



iy own knowledge art true an4 that *fl sittamants rnade on information and belief are 
made with tha Knowledge that willful false statements and the |iKe so made are 
iQ.01 of TWe IB of the United state* Cede, and that such willful false statements rrwy 
ifcsutng thereon. 



Nam* of Person Signing Signature 



Hfe *»*r«*KKl & T 2****" 
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32. 



Final Product 



Isolation of 
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